Commits · 81b9251d5996ec89bcc016c29ecc0b5f0204e59b · Kirill Smelkov / cpython

29 Apr, 2002 1 commit

Mostly in SequenceMatcher.{__chain_b, find_longest_match}: · 81b9251d

Tim Peters authored Apr 29, 2002

This now does a dynamic analysis of which elements are so frequently
repeated as to constitute noise.  The primary benefit is an enormous
speedup in find_longest_match, as the innermost loop can have factors
of 100s less potential matches to worry about, in cases where the
sequences have many duplicate elements.  In effect, this zooms in on
sequences of non-ubiquitous elements now.

While I like what I've seen of the effects so far, I still consider
this experimental.  Please give it a try!

81b9251d

28 Apr, 2002 4 commits

Just added comments, and cleared some XXX questions, related to int · 29c0afcf
Tim Peters authored Apr 28, 2002
```
memory management.
```
29c0afcf
_PyObject_DebugCheckAddress(): If the leading pad bytes are corrupt, · 449b5a8d
Tim Peters authored Apr 28, 2002
```
display a msg warning that the count of bytes requested may be bogus,
and that a segfault may happen next.
```
449b5a8d

Moving pymalloc along. · 8b078f95

Tim Peters authored Apr 28, 2002

As threatened, PyMem_{Free, FREE} also invoke the object deallocator now
when pymalloc is enabled (well, it does when pymalloc isn't enabled too,
but in that case "the object deallocator" is plain free()).

This is maximally backward-compatible, but it leaves a bitter aftertaste.

Also massive reworking of comments.

8b078f95

_PyObject_GC_New: Could call PyObject_INIT with a NULL 1st argument. · fa8efab3
Tim Peters authored Apr 28, 2002
```
_PyObject_GC_NewVar:  Could call PyObject_INIT_VAR likewise.

Bugfix candidate.
```
fa8efab3

27 Apr, 2002 2 commits

Repair widespread misuse of _PyString_Resize. Since it's clear people · 5de9842b

Tim Peters authored Apr 27, 2002

don't understand how this function works, also beefed up the docs.  The
most common usage error is of this form (often spread out across gotos):

	if (_PyString_Resize(&s, n) < 0) {
		Py_DECREF(s);
		s = NULL;
		goto outtahere;
	}

The error is that if _PyString_Resize runs out of memory, it automatically
decrefs the input string object s (which also deallocates it, since its
refcount must be 1 upon entry), and sets s to NULL.  So if the "if"
branch ever triggers, it's an error to call Py_DECREF(s):  s is already
NULL!  A correct way to write the above is the simpler (and intended)

	if (_PyString_Resize(&s, n) < 0)
		goto outtahere;

Bugfix candidate.

5de9842b

SF patch 549375: Compromise PyUnicode_EncodeUTF8 · 602f740b

Tim Peters authored Apr 27, 2002

This implements ideas from Marc-Andre, Martin, Guido and me on Python-Dev.

"Short" Unicode strings are encoded into a "big enough" stack buffer,
then exactly as much string space as they turn out to need is allocated
at the end.  This should have speed benefits akin to Martin's "measure
once, allocate once" strategy, but without needing a distinct measuring
pass.

"Long" Unicode strings allocate as much heap space as they could possibly
need (4 x # Unicode chars), and do a realloc at the end to return the
untouched excess.  Since the overallocation is likely to be substantial,
this shouldn't burden the platform realloc with unusably small excess
blocks.

Also simplified uses of the PyString_xyz functions.  Also added a release-
build check that 4*size doesn't overflow a C int.  Sooner or later, that's
going to happen.

602f740b

26 Apr, 2002 11 commits

Teach the Windows build about the new enumobject.c file. · 73364e64
Tim Peters authored Apr 26, 2002

73364e64
Slightly expand and clarify the differences between getegid(), getgid(), · d3e66787
Fred Drake authored Apr 26, 2002
```
getpgrp(), and setpgid().
This closes SF bug #547939.
```
d3e66787
Be more consistent, both internally and with recommended practice. · a7bb2b9b
Fred Drake authored Apr 26, 2002
```
This closes SF bug #547953.
```
a7bb2b9b
Documentation for the enumerate() function/type. · 38f71973
Fred Drake authored Apr 26, 2002
```
This closes SF patch #547162.
```
38f71973
Clarify that the strip changes also apply to Unicode. · 26dd8301
Guido van Rossum authored Apr 26, 2002

26dd8301

- New builtin function enumerate(x), from PEP 279. Example: · 7dab2426

Guido van Rossum authored Apr 26, 2002

  enumerate("abc") is an iterator returning (0,"a"), (1,"b"), (2,"c").
  The argument can be an arbitrary iterable object.

7dab2426

(py-comint-output-filter-function): Put the pop-to-buffer call inside · 17afa13a
Barry Warsaw authored Apr 26, 2002
```
the `when' condition so other non-Python shell comint changes won't
cause random buffers to pop.
```
17afa13a

PyNumber_CoerceEx: this took a shortcut (not doing anything) when the · 517c7d4f

Guido van Rossum authored Apr 26, 2002

left and right type were of the same type and not classic instances.

This shortcut is dangerous for proxy types, because it means that
coerce(Proxy(1), Proxy(2.1)) leaves Proxy(1) unchanged rather than
turning it into Proxy(1.0).

In an ever-so-slight change of semantics, I now only take the shortcut
when the left and right types are of the same type and don't have the
CHECKTYPES feature.  It so happens that classic instances have this
flag, so the shortcut is still skipped in this case (i.e. nothing
changes for classic instances).  Proxies also have this flag set
(otherwise implementing numeric operations on proxies would become
nightmarish) and this means that the shortcut is also skipped there,
as desired.  It so happens that int, long and float also have this
flag set; that means that e.g. coerce(1, 1) will now invoke
int_coerce().  This is fine: int_coerce() can deal with this, and I'm
not worried about the performance; int_coerce() is only invoked when
the user explicitly calls coerce(), which should be rarer than rare.

517c7d4f

Clean up uses of some deprecated features. · d451ec1c
Fred Drake authored Apr 26, 2002
```
Reported by Neal Norwitz on python-dev.
```
d451ec1c
If Py_OptimizeFlag is false then always evaluate assert conditions, don't · 89e3ee0c
Neil Schemenauer authored Apr 26, 2002
```
test __debug__ at runtime.  Closes SF patch #548833.
```
89e3ee0c

Make sure that tp_free frees the int the same way as tp_dealloc would. · 93646981

Guido van Rossum authored Apr 26, 2002

This fixes the problem that Barry reported on python-dev:
   >>> 23000 .__class__ = bool
crashes in the deallocator.  This was because int inherited tp_free
from object, which uses the default allocator.

2.2. Bugfix candidate.

93646981

25 Apr, 2002 9 commits

Fix typo in the setup of interpreter-mode-alist. · cf22c826
Barry Warsaw authored Apr 25, 2002

cf22c826

SF patch #510288 by Kevin J. Butler, mod'd by Barry. This provides · 88491611

Barry Warsaw authored Apr 25, 2002

better auto-recognition of a Jython file vs. a CPython (or agnostic)
file by looking at the #! line more closely, and inspecting the import
statements in the first 20000 bytes (configurable).  Specifically,

(py-import-check-point-max): New variable, controlling how far into
the buffer it will search for import statements.

(py-jpython-packages): List of package names that are Jython-ish.

(py-shell-alist): List of #! line programs and the modes associated
with them.

(jpython-mode-hook): Extra hook that runs when entering jpython-mode
(what about Jython mode? <20k wink>).

(py-choose-shell-by-shebang, py-choose-shell-by-import,
py-choose-shell): New functions.

(python-mode): Use py-choose-shell.

(jpython-mode): New command.

(py-execute-region): Don't use my previous hacky attempt at doing
this, use the new py-choose-shell function.

One other thing this file now does: it attempts to add the proper
hooks to interpreter-mode-alist and auto-mode-alist if they aren't
already there.  Might help with Emacs users since that editor doesn't
come with python-mode by default.

88491611

Clean up the layout of the bool_as_number struct initializer. · c95cc874
Guido van Rossum authored Apr 25, 2002

c95cc874

(py-execute-region): Alexander Schmolck points out that leading · 4aab68e1

Barry Warsaw authored Apr 25, 2002

whitespace can hose the needs-if test.  So just skip all blank lines
at the start of the region right off the bat.

4aab68e1

Pass the full pathname to MSVC when compiling a debug version. This · 69d31b74
Thomas Heller authored Apr 25, 2002
```
allows the debugger to find the source without asking the user to
browse for it.
```
69d31b74
Append the PC specific include 'PC' and library 'PCBuild' directories · 378498da
Thomas Heller authored Apr 25, 2002
```
under NT - this allows distutils to work with the CVS version or the
source distribution.

Wrap a long line.
```
378498da
Fix trivial typo. · e6500803
Thomas Heller authored Apr 25, 2002

e6500803
(py-comint-output-filter-function): Add a pop-to-buffer call so you · 13caba30
Barry Warsaw authored Apr 25, 2002
```
always get to see the result of e.g. a py-execute-region.  Funny, this
bugged both me /and/ Guido!
```
13caba30

(py-shell-hook): A new hook variable, run at the end of py-shell. · 56bd2ede

Barry Warsaw authored Apr 25, 2002

Allows for some customization of the underlying comint buffer.

(py-shell): Call the new hook.

(info-lookup-maybe-add-help): A new call suggested by Milan Zamazal to
make lookups in the Info documentation easier.

56bd2ede

24 Apr, 2002 3 commits
- Regenerated. · 65300f17
  Jack Jansen authored Apr 24, 2002
```
Bugfix candidate.
```
  65300f17
- Fix a small mistake and complete some function prototypes. · 8da4b592
  Thomas Heller authored Apr 24, 2002
```
SF Patch #547813.
```
  8da4b592
- Add more tests for abstract isinstance() and issubclass(). · 3b04d635
  Neil Schemenauer authored Apr 24, 2002
  
  3b04d635
23 Apr, 2002 10 commits

test_resource has no chance of running on Windows. · 1e33ffa5
Tim Peters authored Apr 23, 2002

1e33ffa5

test_mmap started breaking on Windows, only when run after test_bsddb. · e12cda98

Tim Peters authored Apr 23, 2002

On Win2K it thought 'foo' started at byte offset 0 instead of at the
pagesize, and on Win98 it thought 'foo' didn't exist at all. Somehow
or other this is related to the new "in memory file" gimmicks in
bsddb, but the old bsddb we use on Windows sucks so bad anyway I don't
want to bother digging deeper. Flushing the file in test_mmap after
writing to it makes the problem go away, so good enough.

e12cda98

Unit tests for the changes in abstract.c version 2.101. The debug · 906569de

Barry Warsaw authored Apr 23, 2002

build's "undetected error" problems were originally detected with
extension types, but we can whitebox test the same situations with
new-style classes.

906569de

Regenerated. · 033b79c4
Jack Jansen authored Apr 23, 2002

033b79c4

abstract_get_bases(): Clarify exactly what the return values and · f16951cf

Barry Warsaw authored Apr 23, 2002

states can be for this function, and ensure that only AttributeErrors
are masked.  Any other exception raised via the equivalent of
getattr(cls, '__bases__') should be propagated up.

abstract_issubclass(): If abstract_get_bases() returns NULL, we must
call PyErr_Occurred() to see if an exception is being propagated, and
return -1 or 0 as appropriate.  This is the specific fix for a problem
whereby if getattr(derived, '__bases__') raised an exception, an
"undetected error" would occur (under a debug build).  This nasty
situation was uncovered when writing a security proxy extension type
for the Zope3 project, where the security proxy raised a Forbidden
exception on getattr of __bases__.

PyObject_IsInstance(), PyObject_IsSubclass(): After both calls to
abstract_get_bases(), where we're setting the TypeError if the return
value is NULL, we must first check to see if an exception occurred,
and /not/ mask an existing exception.

Neil Schemenauer should double check that these changes don't break
his ExtensionClass examples (there aren't any test cases for those
examples and abstract_get_bases() was added by him in response to
problems with ExtensionClass).  Neil, please add test cases if
possible!

I belive this is a bug fix candidate for Python 2.2.2.

f16951cf

Converted to use re in stead of regex and regsub (finally:-). · 3adf8d1d
Jack Jansen authored Apr 23, 2002

3adf8d1d
Rewrote the PyUnit description so that it now recommends to use · 5ca53747
Barry Warsaw authored Apr 23, 2002
```
run_suite() instead of run_unittest().  Best practice is to plan for
multiple test classes.
```
5ca53747
Add text about circular references caused by storing frames in local · 99d17006
Fred Drake authored Apr 23, 2002
```
variables.  This closes SF bug #543148.
```
99d17006
Second part of fix for #493826: regenerated suite modules so errn exists but... · 95df3fd1
Jack Jansen authored Apr 23, 2002
```
Second part of fix for #493826: regenerated suite modules so errn exists but == 0 doesn't signal an error.

Bugfix candidate.
```
95df3fd1

First part of fix for #493826: if 'errn' key exists in return value this... · 18983536

Jack Jansen authored Apr 23, 2002

First part of fix for #493826: if 'errn' key exists in return value this doesn't necesarily signal an error, only if the value is non-zero it does. This
does not correspond with my reading of the documentation, but the OSX Finder can return 'errn'=0, and it knows better than me:-)

Bugfix candidate.

18983536