Commits · 8c914587231f48dd69104141fb3b7f1354eee17d · Kirill Smelkov / cpython

12 Aug, 2002 20 commits

k_mul(): Heh -- I checked in two fixes for the last problem. Only keep · 8c914587
Tim Peters authored Aug 12, 2002
```
the good one <wink>.  Also checked in a test-aid by mistake.
```
8c914587

k_mul(): White-box testing turned up that (ah+al)*(bh+bl) can, in rare · 5a8539f5

Tim Peters authored Aug 12, 2002

cases, overflow the allocated result object by 1 bit. In such cases,
it would have been brought back into range if we subtracted al*bl and
ah*bh from it first, but I don't want to do that because it hurts cache
behavior. Instead we just ignore the excess bit when it appears -- in
effect, this is forcing unsigned mod BASE**(asize + bsize) arithmetic
in a case where that doesn't happen all by itself.

5a8539f5

Fix MSVC warnings. · 8b54f1ee
Guido van Rossum authored Aug 12, 2002

8b54f1ee

Refactor how __dict__ and __weakref__ interact with __slots__. · ab5e2d06

Guido van Rossum authored Aug 12, 2002

1. You can now have __dict__ and/or __weakref__ in your __slots__
   (before only __weakref__ was supported).  This is treated
   differently than before: it merely sets a flag that the object
   should support the corresponding magic.

2. Dynamic types now always have descriptors __dict__ and __weakref__
   thrust upon them.  If the type in fact does not support one or the
   other, that descriptor's __get__ method will raise AttributeError.

3. (This is the reason for all this; it fixes SF bug 575229, reported
   by Cesar Douady.)  Given this code:
      class A(object): __slots__ = []
      class B(object): pass
      class C(A, B): __slots__ = []
   the class object for C was broken; its size was less than that of
   B, and some descriptors on B could cause a segfault.  C now
   correctly inherits __weakrefs__ and __dict__ from B, even though A
   is the "primary" base (C.__base__ is A).

4. Some code cleanup, and a few comments added.

ab5e2d06

x_mul(): Made life easier for C optimizers in the "grade school" · 523a2539

Tim Peters authored Aug 12, 2002

algorithm.  MSVC 6 wasn't impressed <wink>.

Something odd:  the x_mul algorithm appears to get substantially worse
than quadratic time as the inputs grow larger:

bits in each input   x_mul time   k_mul time
------------------   ----------   ----------
             15360         0.01         0.00
             30720         0.04         0.01
             61440         0.16         0.04
            122880         0.64         0.14
            245760         2.56         0.40
            491520        10.76         1.23
            983040        71.28         3.69
           1966080       459.31        11.07

That is, x_mul is perfectly quadratic-time until a little burp at
2.56->10.76, and after that goes to hell in a hurry.  Under Karatsuba,
doubling the input size "should take" 3 times longer instead of 4, and
that remains the case throughout this range.  I conclude that my "be nice
to the cache" reworkings of k_mul() are paying.

523a2539

k_mul() and long_mul(): I'm confident that the Karatsuba algorithm is · d880fd68

Tim Peters authored Aug 12, 2002

correct now, so added some final comments, did some cleanup, and enabled
it for all long-int multiplies. The KARAT envar no longer matters,
although I left some #if 0'ed code in there for my own use (temporary).
k_mul() is still much slower than x_mul() if the inputs have very
differenent sizes, and that still needs to be addressed.

d880fd68

Portable way of producing unsigned 32-bit hex output to print the · fcf3a2f0
Guido van Rossum authored Aug 12, 2002
```
CRCs.
```
fcf3a2f0
Shut up warnings about hex()/oct() that can't be avoided. · ebff8c1a
Guido van Rossum authored Aug 12, 2002

ebff8c1a
Avoid warnings about <<. external_attr is now an unsigned long. · 255602b1
Guido van Rossum authored Aug 12, 2002

255602b1

k_mul: Rearranged computation for better cache use. Ignored overflow · d177ece2

Tim Peters authored Aug 12, 2002

(it's possible, but should be harmless -- this requires more thought,
and allocating enough space in advance to prevent it requires exactly
as much thought, to know exactly how much that is -- the end result
certainly fits in the allocated space -- hmm, but that's really all
the thought it needs!  borrows/carries out of the high digits really
are harmless).

d177ece2

Correct PyAPI_FUNC to PyAPI_DATA - sorry Jack. · 73f4c9b4
Mark Hammond authored Aug 12, 2002

73f4c9b4
Add name mangling for new PyUnicode_FromOrdinal() and fix declaration · 9171bbf6
Marc-André Lemburg authored Aug 12, 2002
```
to use new extern macro.
```
9171bbf6
Excise DL_EXPORT from Include. · 81ef4c69
Mark Hammond authored Aug 12, 2002
```
Thanks to Skip Montanaro and Kalle Svensson for the patches.
```
81ef4c69

x_mul(): This failed to normalize its result. · ad331860

Tim Peters authored Aug 12, 2002

k_mul(): This didn't allocate enough result space when one input had
more than twice as many bits as the other. This was partly hidden by
that x_mul() didn't normalize its result.

The Karatsuba recurrence is pretty much hosed if the inputs aren't
roughly the same size. If one has at least twice as many bits as the
other, we get a degenerate case where the "high half" of the smaller
input is 0. Added a special case for that, for speed, but despite that
it helped, this can still be much slower than the "grade school" method.
It seems to take a really wild imbalance to trigger that; e.g., a
2**22-bit input times a 1000-bit input on my box runs about twice as slow
under k_mul than under x_mul. This still needs to be addressed.

I'm also not sure that allocating a->ob_size + b->ob_size digits is
enough, given that this is computing k = (ah+al)*(bh+bl) instead of
k = (ah-al)*(bl-bh); i.e., it's certainly enough for the final result,
but it's vaguely possible that adding in the "artificially" large k may
overflow that temporarily. If so, an assert will trigger in the debug
build, but we'll probably compute the right result anyway(!).

ad331860

Introduced helper functions v_iadd and v_isub, for in-place digit-vector · fa09d4c8

Tim Peters authored Aug 12, 2002

addition and subtraction. Reworked the tail end of k_mul() to use them.
This saves oodles of one-shot longobject allocations (this is a triply-
recursive routine, so saving one allocation in the body saves 3**n
allocations at depth n; we actually save 2 allocations in the body).

fa09d4c8

New news about __class__ assignment restrictions and speed-up of · a9148c34

Guido van Rossum authored Aug 12, 2002

new-style object creation/deallocation.

Moved all news about type/class unification and new-stype classes to a
separate section at the top.

a9148c34

Revert what looks like a typo from the last checkin · bd2299b2
Neal Norwitz authored Aug 12, 2002

bd2299b2
k_mul(): Repaired another typo in another comment. · 632c5009
Tim Peters authored Aug 12, 2002

632c5009
k_mul(): Repaired typo in comment. · 9c9dbf0a
Tim Peters authored Aug 12, 2002

9c9dbf0a

Cautious introduction of a patch that started from · a0f082fa

Tim Peters authored Aug 12, 2002

SF 560379:  Karatsuba multiplication.
Lots of things were changed from that.  This needs a lot more testing,
for correctness and speed, the latter especially when bit lengths are
unbalanced.  For now, the Karatsuba code gets invoked if and only if
envar KARAT exists.

a0f082fa

11 Aug, 2002 8 commits
- Fixed misspelling in comment. · 324d988f
  Tim Peters authored Aug 11, 2002
  
  324d988f
- int_lshift(): Simplified/sped overflow-checking. · 8f446bf0
  Tim Peters authored Aug 11, 2002
  
  8f446bf0
- Extend stripid() to handle strings ending in more than one '>'. · 666f8a4e
  Ka-Ping Yee authored Aug 11, 2002
```
Add resolve() to handle looking up objects and names (fix SF bug 586931).
Add a nicer error message when given a filename that doesn't exist.
```
  666f8a4e
- Reset errno to zero after calling PyErr_Warn(). It can potentially do · e51bca80
  Guido van Rossum authored Aug 11, 2002
```
a lot of work, including I/O (e.g. to import warnings.py), which might
affect errno.
```
  e51bca80
- Use a better check for overflow from a<<b. · 41194426
  Guido van Rossum authored Aug 11, 2002
  
  41194426
- Add C API PyUnicode_FromOrdinal() which exposes unichr() at C level. · 1cf1ada1
  Marc-André Lemburg authored Aug 11, 2002
```
u'%c' will now raise a ValueError in case the argument is an
integer outside the valid range of Unicode code point ordinals.

Closes SF bug #593581.
```
  1cf1ada1
- Implement stage B0 of PEP 237: add warnings for operations that · 937becc9
  Guido van Rossum authored Aug 11, 2002
```
currently return inconsistent results for ints and longs; in
particular: hex/oct/%u/%o/%x/%X of negative short ints, and x<<n that
either loses bits or changes sign.  (No warnings for repr() of a long,
though that will also change to lose the trailing 'L' eventually.)

This introduces some warnings in the test suite; I'll take care of
those later.
```
  937becc9
- test_saveall(): Another small simplification; plus s/l/L/g. · dde0a20b
  Tim Peters authored Aug 11, 2002
```
test_del(), test_del_newclass():  No need to use apply() in these.
```
  dde0a20b
10 Aug, 2002 10 commits

And one more simplification to test_saveall(). · 0d6eddd9
Tim Peters authored Aug 10, 2002

0d6eddd9
test_saveall(): Simplified a little, given that we only expect one item · 67182e9e
Tim Peters authored Aug 10, 2002
```
in gc.garbage (so no need to loop looking for it -- it's there or it's
not).
```
67182e9e

If any trash happened to be sitting around waiting to get collected at · 5ef60469

Tim Peters authored Aug 10, 2002

the time it's called, test_saveall() made it look a leak, triggering
bogus warnings from regrtest's -l (findleaks) mode.

5ef60469

Convert characters from the locale's encoding on output. · b78068d6
Martin v. Löwis authored Aug 10, 2002
```
Reject characters outside the locale's encoding on input.
```
b78068d6
Fixed new typos, added a little info about ~sort versus "hint"s. · 86a5b160
Tim Peters authored Aug 10, 2002

86a5b160
Clarify that the interruptable popen fixes aren't used under Win9x. · 0cf4181a
Mark Hammond authored Aug 10, 2002

0cf4181a
Disallow class assignment completely unless both old and new are heap · 29205785
Guido van Rossum authored Aug 10, 2002
```
types.  This prevents nonsense like 2.__class__ = bool or
True.__class__ = int.
```
29205785

1. Combined the base and length arrays into a single array of structs. · 4c6c91db

Tim Peters authored Aug 10, 2002

   This is friendlier for caches.

2. Cut MIN_GALLOP to 7, but added a per-sort min_gallop vrbl that adapts
   the "get into galloping mode" threshold higher when galloping isn't
   paying, and lower when it is.  There's no known case where this hurts.
   It's (of course) neutral for /sort, \sort and =sort.  It also happens
   to be neutral for !sort.  It cuts a tiny # of compares in 3sort and +sort.
   For *sort, it reduces the # of compares to better than what this used to
   do when MIN_GALLOP was hardcoded to 10 (it did about 0.1% more *sort
   compares before, but given how close we are to the limit, this is "a
   lot"!).  %sort used to do about 1.5% more compares, and ~sort about
   3.6% more.  Here are exact counts:

 i    *sort    3sort    +sort    %sort    ~sort    !sort
15   449235    33019    33016    51328   188720    65534  before
     448885    33016    33007    50426   182083    65534  after
      0.08%    0.01%    0.03%    1.79%    3.65%    0.00%  %ch from after

16   963714    65824    65809   103409   377634   131070
     962991    65821    65808   101667   364341   131070
      0.08%    0.00%    0.00%    1.71%    3.65%    0.00%

17  2059092   131413   131362   209130   755476   262142
    2057533   131410   131361   206193   728871   262142
      0.08%    0.00%    0.00%    1.42%    3.65%    0.00%

18  4380687   262440   262460   421998  1511174   524286
    4377402   262437   262459   416347  1457945   524286
      0.08%    0.00%    0.00%    1.36%    3.65%    0.00%

19  9285709   524581   524634   848590  3022584  1048574
    9278734   524580   524633   837947  2916107  1048574
      0.08%    0.00%    0.00%    1.27%    3.65%    0.00%

20 19621118  1048960  1048942  1715806  6045418  2097150
   19606028  1048958  1048941  1694896  5832445  2097150
      0.08%    0.00%    0.00%    1.23%    3.65%    0.00%

3. Added some key asserts I overlooked before.

4. Updated the doc file.

4c6c91db

The samplesort-vs-mergesort #-of-comparisons comparisons were captured · 7c56af00

Tim Peters authored Aug 10, 2002

before %sort was introduced.  Redid them (the numbers change, but the
conclusions don't).  Also did the samplesort counts with the released
2.2.1, as they're slightly different under the last CVS 2.3 samplesort
(some higher, some lower -- CVS had been changed to stop doing the
special-case business on recursive samplesort calls).

7c56af00

Fix a typo in the mktemp -> mkstemp patch. · 299d1956
Guido van Rossum authored Aug 10, 2002

299d1956

09 Aug, 2002 2 commits

Accomodate the packaging changes when we unpack into the dev/doc/ area · 7d73b4c8
Fred Drake authored Aug 09, 2002
```
on python.org.
```
7d73b4c8

Lots of changes to the packaging of the documentation, all to keep · 503aaf21

Fred Drake authored Aug 09, 2002

directories clean where the packages are unpacked.  Each package now
contains a single directory, Python-Docs-<version>/, which contains the
files for that version of the documentation.

Closes SF feature request #567576.

503aaf21