Commits · 2f228e75e4d5ac8c3eb4a6334dbc43243bff1095 · Kirill Smelkov / cpython

13 May, 2001 1 commit

Get rid of the superstitious "~" in dict hashing's "i = (~hash) & mask". · 2f228e75

Tim Peters authored May 13, 2001

The comment following used to say:
	/* We use ~hash instead of hash, as degenerate hash functions, such
	   as for ints <sigh>, can have lots of leading zeros. It's not
	   really a performance risk, but better safe than sorry.
	   12-Dec-00 tim:  so ~hash produces lots of leading ones instead --
	   what's the gain? */
That is, there was never a good reason for doing it.  And to the contrary,
as explained on Python-Dev last December, it tended to make the *sum*
(i + incr) & mask (which is the first table index examined in case of
collison) the same "too often" across distinct hashes.

Changing to the simpler "i = hash & mask" reduced the number of string-dict
collisions (== # number of times we go around the lookup for-loop) from about
6 million to 5 million during a full run of the test suite (these are
approximate because the test suite does some random stuff from run to run).
The number of collisions in non-string dicts also decreased, but not as
dramatically.

Note that this may, for a given dict, change the order (wrt previous
releases) of entries exposed by .keys(), .values() and .items().  A number
of std tests suffered bogus failures as a result.  For dicts keyed by
small ints, or (less so) by characters, the order is much more likely to be
in increasing order of key now; e.g.,

>>> d = {}
>>> for i in range(10):
...    d[i] = i
...
>>> d
{0: 0, 1: 1, 2: 2, 3: 3, 4: 4, 5: 5, 6: 6, 7: 7, 8: 8, 9: 9}
>>>

Unfortunately. people may latch on to that in small examples and draw a
bogus conclusion.

test_support.py
    Moved test_extcall's sortdict() into test_support, made it stronger,
    and imported sortdict into other std tests that needed it.
test_unicode.py
    Excluced cp875 from the "roundtrip over range(128)" test, because
    cp875 doesn't have a well-defined inverse for unicode("?", "cp875").
    See Python-Dev for excruciating details.
Cookie.py
    Chaged various output functions to sort dicts before building
    strings from them.
test_extcall
    Fiddled the expected-result file.  This remains sensitive to native
    dict ordering, because, e.g., if there are multiple errors in a
    keyword-arg dict (and test_extcall sets up many cases like that), the
    specific error Python complains about first depends on native dict
    ordering.

2f228e75

12 May, 2001 7 commits
- Got the first MacPython module working under MacOSX/MachO (gestalt). Main changes · 0194ad5c
  Jack Jansen authored May 12, 2001
```
are including Carbon/Carbon.h in stead of the old headers (unless WITHOUT_FRAMEWORKS
is defined, as it will be for classic MacPython) and selectively disabling all the
stuff that is unneeded in a unix-Python (event handling, etc).
```
  0194ad5c
- Be more sensible about when to use TARGET_API_MAC_OS8 in stead of... · 6e68a7e7
  Jack Jansen authored May 12, 2001
```
Be more sensible about when to use TARGET_API_MAC_OS8 in stead of !TARGET_API_MAC_CARBON. This should greatly facilitate porting stuff to OSX in its MachO/BSD incarnation.
```
  6e68a7e7
- Added iterobject.c to the project. And trying my first checkin at the same time. · 1bd0a712
  Jack Jansen authored May 12, 2001
  
  1bd0a712
- Repair "module has no attribute xxx" error msg; bug introduced when · 16cabc0a
  Tim Peters authored May 12, 2001
```
switching from tp_getattr to tp_getattro.
```
  16cabc0a
- Refactored, with some future plans in mind. · e9d7f077
  Guido van Rossum authored May 12, 2001
```
This now uses the new gotofileline() method defined in FileList.py.
```
  e9d7f077
- Move the action of loading the configuration to the IdleConf module · 6cb7a219
  Guido van Rossum authored May 12, 2001
```
rather than the idle.py script.  This has advantages and
disadvantages; the biggest advantage being that we can more easily
have an alternative main program.
```
  6cb7a219
- Delete goodname() method, which is unused. · 8451ebb4
  Guido van Rossum authored May 12, 2001
```
Add gotofileline(), a convenience method which I intend to use in a
variant.

Rename test() to _test().
```
  8451ebb4
11 May, 2001 27 commits
- Variant of patch #423262: Change module attribute get & set · d85e1023
  Tim Peters authored May 11, 2001
```
Allow module getattr and setattr to exploit string interning, via the
previously null module object tp_getattro and tp_setattro slots.   Yields
a very nice speedup for things like random.random and os.path etc.
```
  d85e1023
- Fix a minor style consistency issue. · 564a6cc8
  Fred Drake authored May 11, 2001
```
When getting a string buffer for a string we just created, use
PyString_AS_STRING() instead of PyString_AsString() to avoid the
call overhead and extra type check.
```
  564a6cc8
- Fix one bare except: clause. · 7e473800
  Fred Drake authored May 11, 2001
  
  7e473800
- [].index() raises ValueError if the value is not in the list, so only · ef4cdad0
  Fred Drake authored May 11, 2001
```
catch that instead of using a bare except clause.
```
  ef4cdad0
- [].index() raises ValueError if the value is not in the list, so only · 7def2564
  Fred Drake authored May 11, 2001
```
catch that instead of using a bare except clause.
```
  7def2564
- Only catch NameError and TypeError when attempting to subclass an · a2133339
  Fred Drake authored May 11, 2001
```
exception (for compatibility with old versions of Python).
```
  a2133339
- Remove a bare try/except completely -- it just did not make sense! · 6f6a14f8
  Fred Drake authored May 11, 2001
```
Add a comment elsewhere making clear an assumption in the code.
```
  6f6a14f8
- When guarding an import, only catch ImportError. · e8187615
  Fred Drake authored May 11, 2001
  
  e8187615
- Clean up a bare except where we only expect to catch pcre.error. · 31e18291
  Fred Drake authored May 11, 2001
  
  31e18291
- Clean up bare except where only IOError makes sense. · 65255319
  Fred Drake authored May 11, 2001
  
  65255319
- Clean up bare except: when determining whether a file is seekable. · 553f6811
  Fred Drake authored May 11, 2001
  
  553f6811
- Opening a file for reading can raise IOError, so only catch that. · 776d39e2
  Fred Drake authored May 11, 2001
  
  776d39e2
- int() of a string is only expected to through ValueError, so do not use · 1b7e0795
  Fred Drake authored May 11, 2001
```
a bare except clause.
```
  1b7e0795
- <socket>.getsockopt() and <socket>.setsockopt() can only raise socket.error, · 9f9b593f
  Fred Drake authored May 11, 2001
```
so only catch that specific exception.
```
  9f9b593f
- Catch only the relevant exceptions instead of using a bare except clause. · c79f3d0d
  Fred Drake authored May 11, 2001
  
  c79f3d0d
- Add some text to make the dircmp object section more readable, and move · ba7e2c8a
  Fred Drake authored May 11, 2001
```
some stuff around.
```
  ba7e2c8a
- Include sys/modem.h if we have it; this is needed on HP-UX to provide · 87068f1e
  Fred Drake authored May 11, 2001
```
constants used by other macros from the headers.

Conditionalize VREPRINT and VDISCARD; these are not available on HP-UX.

This closes bug #417418.
```
  87068f1e
- the usual... · bbc3c5aa
  Fred Drake authored May 11, 2001
  
  bbc3c5aa
- Add a check for sys/modem.h, needed by termios on HP-UX. · 2ca5f3b7
  Fred Drake authored May 11, 2001
  
  2ca5f3b7
- --sigh-- · 886f113b
  Fred Drake authored May 11, 2001
```
Finish the last set of changes to these files so the conversion does not
break.
```
  886f113b
- Markup adjustments to avoid getting junk in the index. · 47852467
  Fred Drake authored May 11, 2001
  
  47852467
- Variant of SF patch 423181 · 1b0feb4a
  Jeremy Hylton authored May 11, 2001
```
For rich comparisons, use instance_getattr2() when possible to avoid
the expense of setting an AttributeError.  Also intern the name_op[]
table and use the interned strings rather than creating a new string
and interning it each time through.
```
  1b0feb4a
- unlink() would normally be found in the "os" module, so use it from there. · 6278799f
  Fred Drake authored May 11, 2001
```
Remove unused import of "sys".

If the file TESTFN exists before we start, try to remove it.

Add spaces around the = in some assignments.
```
  6278799f
- Cosmetic: code under "else" clause was missing indent. · 5acbfcc1
  Tim Peters authored May 11, 2001
  
  5acbfcc1
- Replace "\begin{classdesc}{SomeClass}{\unspecified}" with · 96d7a706
  Fred Drake authored May 11, 2001
```
"\begin{classdesc*}{SomeClass}" -- the rendering of \unspecified was
identical to \moreargs, so this helps clarify things just a little.
```
  96d7a706
- Document the new classdesc* environment, and the previously undocumented · 9f2376de
  Fred Drake authored May 11, 2001
```
excclassdesc environment.
```
  9f2376de
- Define a new environment, classdesc*, which can be used to document a · 06a01e84
  Fred Drake authored May 11, 2001
```
class without providing any information about the constructor.  This
should be used for classes which only exist to act as containers rather
than as factories for instances.
```
  06a01e84
10 May, 2001 5 commits

Write a better synopsis for the Scrap module, and provide a link to · 986badae
Fred Drake authored May 10, 2001
```
useful documentation on the Scrap Manager.
```
986badae
Actually include a synopsis line for the ColorPicker module. · 0eeca634
Fred Drake authored May 10, 2001

0eeca634

Restore dicts' tp_compare slot, and change dict_richcompare to say it · 4fa58bfa

Tim Peters authored May 10, 2001

doesn't know how to do LE, LT, GE, GT. dict_richcompare can't do the
latter any faster than dict_compare can. More importantly, for
cmp(dict1, dict2), Python *first* tries rich compares with EQ, LT, and
GT one at a time, even if the tp_compare slot is defined, and
dict_richcompare called dict_compare for the latter two because
it couldn't do them itself. The result was a lot of wasted calls to
dict_compare. Now dict_richcompare gives up at once the times Python
calls it with LT and GT from try_rich_to_3way_compare(), and dict_compare
is called only once (when Python gets around to trying the tp_compare
slot).
Continued mystery: despite that this cut the number of calls to
dict_compare approximately in half in test_mutants.py, the latter still
runs amazingly slowly. Running under the debugger doesn't show excessive
activity in the dict comparison code anymore, so I'm guessing the culprit
is somewhere else -- but where? Perhaps in the element (key/value)
comparison code? We clearly spend a lot of time figuring out how to
compare things.

4fa58bfa

Make test_mutants stronger by also adding random keys during comparisons. · 4c02fecf

Tim Peters authored May 10, 2001

A Mystery: test_mutants ran amazingly slowly even before dictobject.c
"got fixed". I don't have a clue as to why. dict comparison was and
remains linear-time in the size of the dicts, and test_mutants only tries
100 dict pairs, of size averaging just 50. So "it should" run in less than
an eyeblink; but it takes at least a second on this 800MHz box.

4c02fecf

Change test_mmap.py to use test_support.TESTFN instead of hardcoded "foo", · fd69208b
Tim Peters authored May 10, 2001
```
and wrap the body in try/finally to ensure TESTFN gets cleaned up no
matter what.
```
fd69208b