NEWS 252 KB
Newer Older
1 2 3 4
+++++++++++
Python News
+++++++++++

Anthony Baxter's avatar
Anthony Baxter committed
5 6
(editors: check NEWS.help for information about editing NEWS using ReST.)

7 8 9
What's New in Python 2.5 alpha 1?
=================================

10 11
*Release date: XX-XXX-2006*

12 13 14
Core and builtins
-----------------

15 16 17 18
- Previously, Python code had no easy way to access the contents of a
  cell object. Now, a ``cell_contents`` attribute has been added
  (closes patch #1170323).

19 20 21 22 23 24 25 26 27 28 29 30
- Patch #1123430: Python's small-object allocator now returns an arena to
  the system ``free()`` when all memory within an arena becomes unused
  again.  Prior to Python 2.5, arenas (256KB chunks of memory) were never
  freed.  Some applications will see a drop in virtual memory size now,
  especially long-running applications that, from time to time, temporarily
  use a large number of small objects.  Note that when Python returns an
  arena to the platform C's ``free()``, there's no guarantee that the
  platform C will in turn return that memory to the operating system.  The
  effect of the patch is to stop making that impossible, and in tests it
  appears to be effective at least on Microsoft C and gcc-based systems.
  Thanks to Evan Jones for hard work and patience.

31 32 33 34
- Patch #1434038: property() now uses the getter's docstring if there is
  no "doc" argument given. This makes it possible to legitimately use
  property() as a decorator to produce a read-only property.

Guido van Rossum's avatar
Guido van Rossum committed
35 36 37 38 39
- PEP 357, patch 1436368: add an __index__ method to int/long and a matching
  nb_index slot to the PyNumberMethods struct.  The slot is consulted instead
  of requiring an int or long in slicing and a few other contexts, enabling
  other objects (e.g. Numeric Python's integers) to be used as slice indices.

Neal Norwitz's avatar
Neal Norwitz committed
40 41
- Fixed various bugs reported by Coverity's Prevent tool.

42 43 44 45 46
- PEP 352, patch #1104669: Make exceptions new-style objects.  Introduced the
  new exception base class, BaseException, which has a new message attribute.
  KeyboardInterrupt and SystemExit to directly inherit from BaseException now.
  Raising a string exception now raises a DeprecationWarning.

47 48 49 50 51 52
- Patch #1438387, PEP 328: relative and absolute imports. Imports can now be
  explicitly relative, using 'from .module import name' to mean 'from the same
  package as this module is in. Imports without dots still default to the
  old relative-then-absolute, unless 'from __future__ import
  absolute_import' is used.

53 54 55 56
- Properly check if 'warnings' raises an exception (usually when a filter set
  to "error" is triggered) when raising a warning for raising string
  exceptions.

57 58 59
- CO_GENERATOR_ALLOWED is no longer defined, this behavior is the default.
  The name was removed from Include/code.h.

Neal Norwitz's avatar
Neal Norwitz committed
60 61
- PEP 308: conditional expressions were added (x if cond else y).

Guido van Rossum's avatar
Guido van Rossum committed
62 63 64 65 66
- Patch 1433928:
  - The copy module now "copies" function objects (as atomic objects).
  - dict.__getitem__ now looks for a __missing__ hook before raising
    KeyError.

Tim Peters's avatar
Tim Peters committed
67 68 69
- PEP 343: with statement implemented. Needs ``from __future__ import
  with_statement``. Use of 'with' as a variable will generate a warning.
  Use of 'as' as a variable will also generate a warning (unless it's
70
  part of an import statement).
71 72 73 74 75 76
  The following objects have __context__ methods:
  - The built-in file type.
  - The thread.LockType type.
  - The following types defined by the threading module:
    Lock, RLock, Condition, Semaphore, BoundedSemaphore.
  - The decimal.Context class.
77

78 79 80 81 82 83
- Fix the encodings package codec search function to only search
  inside its own package. Fixes problem reported in patch #1433198.

  Note: Codec packages should implement and register their own
  codec search function. PEP 100 has the details.

Martin v. Löwis's avatar
Martin v. Löwis committed
84 85
- PEP 353: Using ssize_t as the index type.

86 87 88 89
- Patch #1400181, fix unicode string formatting to not use the locale.
  This is how string objects work.  u'%f' could use , instead of .
  for the decimal point.  Now both strings and unicode always use periods.

90 91 92
- Bug #1244610, #1392915, fix build problem on OpenBSD 3.7 and 3.8.
  configure would break checking curses.h.

93 94 95
- Bug #959576: The pwd module is now builtin. This allows Python to be
  built on UNIX platforms without $HOME set.

96 97
- Bug #1072182, fix some potential problems if characters are signed.

98 99
- Bug #889500, fix line number on SyntaxWarning for global declarations.

100 101
- Bug #1378022, UTF-8 files with a leading BOM crashed the interpreter.

102 103 104
- Support for converting hex strings to floats no longer works.
  This was not portable.  float('0x3') now raises a ValueError.

105
- Patch #1382163: Expose Subversion revision number to Python.  New C API
106
  function Py_GetBuildNumber().  New attribute sys.subversion.  Build number
107 108
  is now displayed in interactive prompt banner.

Georg Brandl's avatar
Georg Brandl committed
109 110
- Implementation of PEP 341 - Unification of try/except and try/finally.
  "except" clauses can now be written together with a "finally" clause in
111
  one try statement instead of two nested ones.  Patch #1355913.
Georg Brandl's avatar
Georg Brandl committed
112

113 114 115
- Bug #1379994: Builtin unicode_escape and raw_unicode_escape codec
  now encodes backslash correctly.

116 117
- Patch #1350409: Work around signal handling bug in Visual Studio 2005.

118 119 120
- Bug #1281408: Py_BuildValue now works correct even with unsigned longs
  and long longs.

121 122 123 124
- SF Bug #1350188, "setdlopenflags" leads to crash upon "import"
  It was possible dlerror() returns a NULL pointer, use a default error
  message in this case.

125 126 127 128 129 130
- Replaced most Unicode charmap codecs with new ones using the
  new Unicode translate string feature in the builtin charmap
  codec; the codecs were created from the mapping tables available
  at ftp.unicode.org and contain a few updates (e.g. the Mac OS
  encodings now include a mapping for the Apple logo)

131
- Added a few more codecs for Mac OS encodings
132

133 134
- Speed up some Unicode operations.

135 136
- A new AST parser implementation was completed. The abstract
  syntax tree is available for read-only (non-compile) access
Martin v. Löwis's avatar
Martin v. Löwis committed
137
  to Python code; an _ast module was added.
138

Neal Norwitz's avatar
Neal Norwitz committed
139
- SF bug #1167751: fix incorrect code being for generator expressions.
140 141
  The following code now raises a SyntaxError:  foo(a = i for i in range(10))

142 143
- SF Bug #976608: fix SystemError when mtime of an imported file is -1.

144 145 146
- SF Bug #887946: fix segfault when redirecting stdin from a directory.
  Provide a warning when a directory is passed on the command line.

147 148 149 150
- Fix segfault with invalid coding.

- SF bug #772896: unknown encoding results in MemoryError.

151 152 153 154
- All iterators now have a Boolean value of true.  Formerly, some iterators
  supported a __len__() method which evaluated to False when the iterator
  was empty.

155 156 157
- On 64-bit platforms, when __len__() returns a value that cannot be
  represented as a C int, raise OverflowError.

158
- test__locale is skipped on OS X < 10.4 (only partial locale support is
Walter Dörwald's avatar
Walter Dörwald committed
159
  present).
160

161 162 163
- SF bug #893549: parsing keyword arguments was broken with a few format
  codes.

164 165 166 167
- Changes donated by Elemental Security to make it work on AIX 5.3
  with IBM's 64-bit compiler (SF patch #1284289).  This also closes SF
  bug #105470: test_pwd fails on 64bit system (Opteron).

168 169 170
- Changes donated by Elemental Security to make it work on HP-UX 11 on
  Itanium2 with HP's 64-bit compiler (SF patch #1225212).

Georg Brandl's avatar
typo  
Georg Brandl committed
171
- Disallow keyword arguments for type constructors that don't use them
172 173
  (fixes bug #1119418).

174 175
- Forward UnicodeDecodeError into SyntaxError for source encoding errors.

176 177 178
- SF bug #900092: When tracing (e.g. for hotshot), restore 'return' events for
  exceptions that cause a function to exit.

179 180 181 182
- The implementation of set() and frozenset() was revised to use its
  own internal data structure.  Memory consumption is reduced by 1/3
  and there are modest speed-ups as well.  The API is unchanged.

183 184
- SF bug #1238681:  freed pointer is used in longobject.c:long_pow().

Michael W. Hudson's avatar
Fix:  
Michael W. Hudson committed
185 186 187
- SF bug #1229429: PyObject_CallMethod failed to decrement some
  reference counts in some error exit cases.

188 189 190 191 192 193 194 195 196
- SF bug #1185883:  Python's small-object memory allocator took over
  a block managed by the platform C library whenever a realloc specified
  a small new size.  However, there's no portable way to know then how
  much of the address space following the pointer is valid, so no
  portable way to copy data from the C-managed block into Python's
  small-object space without risking a memory fault.  Python's small-object
  realloc now leaves such blocks under the control of the platform C
  realloc.

Michael W. Hudson's avatar
Fix bug  
Michael W. Hudson committed
197 198 199
- SF bug #1232517: An overflow error was not detected properly when
  attempting to convert a large float to an int in os.utime().

200 201 202
- SF bug #1224347: hex longs now print with lowercase letters just
  like their int counterparts.

Michael W. Hudson's avatar
Michael W. Hudson committed
203 204 205 206 207 208
- SF bug #1163563: the original fix for bug #1010677 ("thread Module
  Breaks PyGILState_Ensure()") broke badly in the case of multiple
  interpreter states; back out that fix and do a better job (see
  http://mail.python.org/pipermail/python-dev/2005-June/054258.html
  for a longer write-up of the problem).

Michael W. Hudson's avatar
Michael W. Hudson committed
209 210 211
- SF patch #1180995: marshal now uses a binary format by default when
  serializing floats.

212 213 214 215
- SF patch #1181301: on platforms that appear to use IEEE 754 floats,
  the routines that promise to produce IEEE 754 binary representations
  of floats now simply copy bytes around.

216 217 218
- bug #967182: disallow opening files with 'wU' or 'aU' as specified by PEP
  278.

219 220 221 222
- patch #1109424: int, long, float, complex, and unicode now check for the
  proper magic slot for type conversions when subclassed.  Previously the
  magic slot was ignored during conversion.  Semantics now match the way
  subclasses of str always behaved.  int/long/float, conversion of an instance
Walter Dörwald's avatar
Walter Dörwald committed
223
  to the base class has been moved to the proper nb_* magic slot and out of
224 225 226
  PyNumber_*().
  Thanks Walter Dörwald.

227 228 229 230 231
- Descriptors defined in C with a PyGetSetDef structure, where the setter is
  NULL, now raise an AttributeError when attempting to set or delete the
  attribute.  Previously a TypeError was raised, but this was inconsistent
  with the equivalent pure-Python implementation.

Michael W. Hudson's avatar
Fix:  
Michael W. Hudson committed
232 233 234 235 236 237
- It is now safe to call PyGILState_Release() before
  PyEval_InitThreads() (note that if there is reason to believe there
  are multiple threads around you still must call PyEval_InitThreads()
  before using the Python API; this fix is for extension modules that
  have no way of knowing if Python is multi-threaded yet).

238 239 240
- Typing Ctrl-C whilst raw_input() was waiting in a build with threads
  disabled caused a crash.

241 242 243
- Bug #1165306: instancemethod_new allowed the creation of a method
  with im_class == im_self == NULL, which caused a crash when called.

244
- Move exception finalisation later in the shutdown process - this
245 246
  fixes the crash seen in bug #1165761

247 248
- Added two new builtins, any() and all().

249
- Defining a class with empty parentheses is now allowed
250 251 252
  (e.g., ``class C(): pass`` is no longer a syntax error).
  Patch #1176012 added support to the 'parser' module and 'compiler' package
  (thanks to logistix for that added support).
253

254 255
- Patch #1115086: Support PY_LONGLONG in structmember.

256 257 258
- Bug #1155938: new style classes did not check that __init__() was
  returning None.

259
- Patch #802188: Report characters after line continuation character
260 261
  ('\') with a specific error message.

262 263
- Bug #723201: Raise a TypeError for passing bad objects to 'L' format.

Michael W. Hudson's avatar
Michael W. Hudson committed
264 265 266
- Bug #1124295: the __name__ attribute of file objects was
  inadvertently made inaccessible in restricted mode.

267
- Bug #1074011: closing sys.std{out,err} now causes a flush() and
268 269
  an ferror() call.

270 271 272
- min() and max() now support key= arguments with the same meaning as in
  list.sort().

273 274 275
- The peephole optimizer now performs simple constant folding in expressions:
      (2+3) --> (5).

276 277
- set and frozenset objects can now be marshalled.  SF #1098985.

278 279
- Bug #1077106: Poor argument checking could cause memory corruption
  in calls to os.read().
280

281 282 283 284
- The parser did not complain about future statements in illegal
  positions.  It once again reports a syntax error if a future
  statement occurs after anything other than a doc string.

285 286 287 288
- Change the %s format specifier for str objects so that it returns a
  unicode instance if the argument is not an instance of basestring and
  calling __str__ on the argument returns a unicode instance.

289 290 291 292 293 294
- Patch #1413181:  changed ``PyThreadState_Delete()`` to forget about the
  current thread state when the auto-GIL-state machinery knows about
  it (since the thread state is being deleted, continuing to remember it
  can't help, but can hurt if another thread happens to get created with
  the same thread id).

295 296 297
Extension Modules
-----------------

298 299 300
- Patch #1231053: The audioop module now supports encoding/decoding of alaw.
  In addition, the existing ulaw code was updated.

301 302 303
- RFE #567972: Socket objects' family, type and proto properties are
  now exposed via new get...() methods.

304 305 306 307 308 309 310 311
- Everything under lib-old was removed.  This includes the following modules:
    Para, addpack, cmp, cmpcache, codehack, dircmp, dump, find, fmt, grep,
    lockfile, newdir, ni, packmail, poly, rand, statcache, tb, tzparse, 
    util, whatsound, whrandom, zmod

- The following modules were removed:  regsub, reconvert, regex, regex_syntax.

- re and sre were swapped, so help(re) provides full help.  importing sre
312 313
  is deprecated.  The undocumented re.engine variable no longer exists.

314 315 316
- Bug #1448490: Fixed a bug that ISO-2022 codecs could not handle
  SS2 (single-shift 2) escape sequences correctly.

317 318 319 320
- The unicodedata module was updated to the 4.1 version of the Unicode
  database. The 3.2 version is still available as unicodedata.db_3_2_0
  for applications that require this specific version (such as IDNA).

321 322 323
- The timing module is no longer built by default.  It was deprecated
  in PEP 4 in Python 2.0 or earlier.

Guido van Rossum's avatar
Guido van Rossum committed
324 325 326
- Patch 1433928: Added a new type, defaultdict, to the collections module.
  This uses the new __missing__ hook behavior added to dict (see above).

327 328 329
- Bug #854823: socketmodule now builds on Sun platforms even when
  INET_ADDRSTRLEN is not defined.

330 331 332
- Patch #1393157: os.startfile() now has an optional argument to specify
  a "command verb" to invoke on the file.

333 334 335
- Bug #876637, prevent stack corruption when socket descriptor
  is larger than FD_SETSIZE.

336 337 338 339
- Patch #1407135, bug #1424041: harmonize mmap behavior of anonymous memory.
  mmap.mmap(-1, size) now returns anonymous memory in both Unix and Windows.
  mmap.mmap(0, size) should not be used on Windows for anonymous memory.

340 341 342
- Patch #1422385: The nis module now supports access to domains other
  than the system default domain.

343 344 345 346
- Use Win32 API to implement os.stat/fstat. As a result, subsecond timestamps
  are reported, the limit on path name lengths is removed, and stat reports
  WindowsError now (instead of OSError).

347 348
- Add bsddb.db.DBEnv.set_tx_timestamp allowing time based database recovery.

349 350 351
- Bug #1413192, fix seg fault in bsddb if a transaction was deleted
  before the env.

352 353
- Patch #1103116: Basic AF_NETLINK support.

354 355
- Bug #1402308, (possible) segfault when using mmap.mmap(-1, ...)

356 357 358 359
- Bug #1400822, _curses over{lay,write} doesn't work when passing 6 ints.
  Also fix ungetmouse() which did not accept arguments properly.
  The code now conforms to the documented signature.

360 361 362
- Bug #1400115, Fix segfault when calling curses.panel.userptr()
  without prior setting of the userptr.

363 364
- Fix 64-bit problems in bsddb.

365 366
- Patch #1365916: fix some unsafe 64-bit mmap methods.

367 368 369
- Bug #1290333: Added a workaround for cjkcodecs' _codecs_cn build
  problem on AIX.

370 371
- Bug #869197: os.setgroups rejects long integer arguments

372 373
- Bug #1346533, select.poll() doesn't raise an error if timeout > sys.maxint

374 375
- Bug #1344508, Fix UNIX mmap leaking file descriptors

376 377 378
- Patch #1338314, Bug #1336623: fix tarfile so it can extract
  REGTYPE directories from tarfiles written by old programs.

379 380 381 382 383 384
- Patch #1407992, fixes broken bsddb module db associate when using
  BerkeleyDB 3.3, 4.0 or 4.1.

- Get bsddb module to build with BerkeleyDB version 4.4

- Get bsddb module to build with BerkeleyDB version 3.2
385

386 387 388
- Patch #1309009, Fix segfault in pyexpat when the XML document is in latin_1,
  but Python incorrectly assumes it is in UTF-8 format

389 390
- Fix parse errors in the readline module when compiling without threads.

391 392 393
- Patch #1288833: Removed thread lock from socket.getaddrinfo on
  FreeBSD 5.3 and later versions which got thread-safe getaddrinfo(3).

394 395 396
- Patches #1298449 and #1298499: Add some missing checks for error
  returns in cStringIO.c.

397 398 399
- Patch #1297028: fix segfault if call type on MultibyteCodec,
  MultibyteStreamReader, or MultibyteStreamWriter

Neal Norwitz's avatar
Neal Norwitz committed
400 401
- Fix memory leak in posix.access().

402 403
- Patch #1213831: Fix typo in unicodedata._getcode.

Georg Brandl's avatar
Georg Brandl committed
404 405 406
- Bug #1007046: os.startfile() did not accept unicode strings encoded in
  the file system encoding.

407 408
- Patch #756021: Special-case socket.inet_aton('255.255.255.255') for
  platforms that don't have inet_aton().
409

410 411 412
- Bug #1215928: Fix bz2.BZ2File.seek() for 64-bit file offsets.

- Bug #1191043: Fix bz2.BZ2File.(x)readlines for files containing one
413 414
  line without newlines.

415 416 417
- Bug #728515: mmap.resize() now resizes the file on Unix as it did
  on Windows.

Walter Dörwald's avatar
Walter Dörwald committed
418
- Patch #1180695: Add nanosecond stat resolution, and st_gen,
419 420
  st_birthtime for FreeBSD.

421 422 423 424
- Patch #1231069: The fcntl.ioctl function now uses the 'I' code for
  the request code argument, which results in more C-like behaviour
  for large or negative values.

425
- Bug #1234979: For the argument of thread.Lock.acquire, the Windows
Andrew M. Kuchling's avatar
Andrew M. Kuchling committed
426
  implementation treated all integer values except 1 as false.
427

428 429
- Bug #1194181: bz2.BZ2File didn't handle mode 'U' correctly.

430 431 432
- Patch #1212117: os.stat().st_flags is now accessible as a attribute
  if available on the platform.

433 434 435
- Patch #1103951: Expose O_SHLOCK and O_EXLOCK in the posix module if
  available on the platform.

436 437 438
- Bug #1166660: The readline module could segfault if hook functions
  were set in a different thread than that which called readline.

439 440
- collections.deque objects now support a remove() method.

441 442 443 444
- operator.itemgetter() and operator.attrgetter() now support retrieving
  multiple fields.  This provides direct support for sorting on multiple
  keys (primary, secondary, etc).

445 446
- os.access now supports Unicode path names on non-Win32 systems.

447 448 449
- Patches #925152, #1118602: Avoid reading after the end of the buffer
  in pyexpat.GetInputContext.

450
- Patches #749830, #1144555: allow UNIX mmap size to default to current
451 452
  file size.

453 454
- Added functional.partial().  See PEP309.

455 456 457
- Patch #1093585: raise a ValueError for negative history items in readline.
  {remove_history,replace_history}

Fred Drake's avatar
Fred Drake committed
458 459
- The spwd module has been added, allowing access to the shadow password
  database.
460

461 462
- stat_float_times is now True.

463 464
- array.array objects are now picklable.

465 466 467
- the cPickle module no longer accepts the deprecated None option in the
  args tuple returned by __reduce__().

468 469 470 471
- itertools.islice() now accepts None for the start and step arguments.
  This allows islice() to work more readily with slices:
      islice(s.start, s.stop, s.step)

472 473
- datetime.datetime() now has a strptime class method which can be used to
  create datetime object using a string and format.
474 475 476 477

Library
-------

478 479 480 481 482 483
- Patch #1436130: codecs.lookup() now returns a CodecInfo object (a subclass
  of tuple) that provides incremental decoders and encoders (a way to use
  stateful codecs without the stream API). Functions
  codecs.getincrementaldecoder() and codecs.getincrementalencoder() have
  been added.

484 485 486 487
- Patch #1359365: Calling next() on a closed StringIO.String object raises
  a ValueError instead of a StopIteration now (like file and cString.String do).
  cStringIO.StringIO.isatty() will raise a ValueError now if close() has been
  called before (like file and StringIO.StringIO do).
488

Martin v. Löwis's avatar
Martin v. Löwis committed
489 490
- A regrtest option -w was added to re-run failed tests in verbose mode.

Georg Brandl's avatar
Georg Brandl committed
491 492 493
- Patch #1446372: quit and exit can now be called from the interactive
  interpreter to exit.

494 495 496
- The function get_count() has been added to the gc module, and gc.collect()
  grew an optional 'generation' argument.

Martin v. Löwis's avatar
Martin v. Löwis committed
497 498 499
- A library msilib to generate Windows Installer files, and a distutils
  command bdist_msi have been added.

500 501 502
- PEP 343: new module contextlib.py defines decorator @contextmanager
  and helpful context managers nested() and closing().

Martin v. Löwis's avatar
Martin v. Löwis committed
503 504
- The compiler package now supports future imports after the module docstring.

505 506 507
- Bug #1413790: zipfile now sanitizes absolute archive names that are
  not allowed by the specs.

508 509 510
- Patch #1215184: FileInput now can be given an opening hook which can
  be used to control how files are opened.

511 512 513
- Patch #1212287: fileinput.input() now has a mode parameter for
  specifying the file mode input files should be opened with.

514 515 516
- Patch #1215184: fileinput now has a fileno() function for getting the
  current file number.

517 518 519
- Patch #1349274: gettext.install() now optionally installs additional
  translation functions other than _() in the builtin namespace.

520 521
- Patch #1337756: fileinput now accepts Unicode filenames.

522 523 524
- Patch #1373643: The chunk module can now read chunks larger than
  two gigabytes.

525 526
- Patch #1417555: SimpleHTTPServer now returns Last-Modified headers.

527 528 529
- Bug #1430298: It is now possible to send a mail with an empty
  return address using smtplib.

Georg Brandl's avatar
Georg Brandl committed
530 531
- Bug #1432260: The names of lambda functions are now properly displayed
  in pydoc.
532

533 534 535
- Patch #1412872: zipfile now sets the creator system to 3 (Unix)
  unless the system is Win32.

Tim Peters's avatar
Tim Peters committed
536
- Patch #1349118: urllib now supports user:pass@ style proxy
537 538 539
  specifications, raises IOErrors when proxies for unsupported protocols
  are defined, and uses the https proxy on https redirections.

540 541 542
- Bug #902075: urllib2 now supports 'host:port' style proxy specifications.

- Bug #1407902: Add support for sftp:// URIs to urlparse.
543

544 545
- Bug #1371247: Update Windows locale identifiers in locale.py.

Neal Norwitz's avatar
Neal Norwitz committed
546
- Bug #1394565: SimpleHTTPServer now doesn't choke on query parameters
547
  any more.
548

549 550
- Bug #1403410: The warnings module now doesn't get confused
  when it can't find out the module name it generates a warning for.
551

552 553
- Patch #1177307: Added a new codec utf_8_sig for UTF-8 with a BOM signature.

554 555
- Patch #1157027: cookielib mishandles RFC 2109 cookies in Netscape mode

556 557 558 559
- Patch #1117398: cookielib.LWPCookieJar and .MozillaCookieJar now raise
  LoadError as documented, instead of IOError.  For compatibility,
  LoadError subclasses IOError.

560
- Added the hashlib module.  It provides secure hash functions for MD5 and
561 562 563 564 565 566 567 568 569 570
  SHA1, 224, 256, 384, and 512.  Note that recent developments make the
  historic MD5 and SHA1 unsuitable for cryptographic-strength applications.
  In <http://mail.python.org/pipermail/python-dev/2005-December/058850.html>
  Ronald L. Rivest offered this advice for Python:

      "The consensus of researchers in this area (at least as
      expressed at the NIST Hash Function Workshop 10/31/05),
      is that SHA-256 is a good choice for the time being, but
      that research should continue, and other alternatives may
      arise from this research.  The larger SHA's also seem OK."
571

572 573 574 575
- Added a subset of Fredrik Lundh's ElementTree package.  Available
  modules are xml.etree.ElementTree, xml.etree.ElementPath, and
  xml.etree.ElementInclude, from ElementTree 1.2.6.

576 577
- Patch #1162825: Support non-ASCII characters in IDLE window titles.

578 579
- Bug #1365984: urllib now opens "data:" URLs again.

580 581 582 583
- Patch #1314396: prevent deadlock for threading.Thread.join() when an exception
  is raised within the method itself on a previous call (e.g., passing in an
  illegal argument)

584 585 586
- Bug #1340337: change time.strptime() to always return ValueError when there
  is an error in the format string.

587 588
- Patch #754022: Greatly enhanced webbrowser.py (by Oleg Broytmann).

Walter Dörwald's avatar
Walter Dörwald committed
589
- Bug #729103: pydoc.py: Fix docother() method to accept additional
590 591
  "parent" argument.

592 593 594
- Patch #1300515: xdrlib.py: Fix pack_fstring() to really use null bytes
  for padding.

595 596 597
- Bug #1296004: httplib.py: Limit maximal amount of data read from the
  socket to avoid a MemoryError on Windows.

598 599 600 601 602 603
- Patch #1166948: locale.py: Prefer LC_ALL, LC_CTYPE and LANG over LANGUAGE
  to get the correct encoding.

- Patch #1166938: locale.py: Parse LANGUAGE as a colon separated list of
  languages.

604 605
- Patch #1268314: Cache lines in StreamReader.readlines for performance.

606 607
- Bug #1290505: Fix clearing the regex cache for time.strptime().

608 609
- Bug #1167128: Fix size of a symlink in a tarfile to be 0.

610 611 612
- Patch #810023: Fix off-by-one bug in urllib.urlretrieve reporthook
  functionality.

613 614
- Bug #1163178: Make IDNA return an empty string when the input is empty.

615 616 617
- Patch #848017: Make Cookie more RFC-compliant. Use CRLF as default output
  separator and do not output trailing semicola.

618 619 620 621
- Patch #1062060: urllib.urlretrieve() now raises a new exception, named
  ContentTooShortException, when the actually downloaded size does not
  match the Content-Length header.

622 623
- Bug #1121494: distutils.dir_utils.mkpath now accepts Unicode strings.

624 625
- Bug #1178484: Return complete lines from codec stream readers
  even if there is an exception in later lines, resulting in
Walter Dörwald's avatar
Walter Dörwald committed
626
  correct line numbers for decoding errors in source code.
627

628 629
- Bug #1192315: Disallow negative arguments to clear() in pdb.

630 631
- Patch #827386: Support absolute source paths in msvccompiler.py.

632 633 634
- Patch #1105730: Apply the new implementation of commonprefix in posixpath
  to ntpath, macpath, os2emxpath and riscospath.

635 636 637 638
- Fix a problem in Tkinter introduced by SF patch #869468: delete bogus
  __hasattr__ and __delattr__ methods on class Tk that were breaking
  Tkdnd.

639 640 641
- Bug #1015140: disambiguated the term "article id" in nntplib docs and
  docstrings to either "article number" or "message id".

642 643 644
- Bug #1238170: threading.Thread.__init__ no longer has "kwargs={}" as a
  parameter, but uses the usual "kwargs=None".

Andrew M. Kuchling's avatar
Andrew M. Kuchling committed
645
- textwrap now processes text chunks at O(n) speed instead of O(n**2).
646 647
  Patch #1209527 (Contributed by Connelly).

648 649 650
- urllib2 has now an attribute 'httpresponses' mapping from HTTP status code
  to W3C name (404 -> 'Not Found'). RFE #1216944.

651 652 653
- Bug #1177468: Don't cache the /dev/urandom file descriptor for os.urandom,
  as this can cause problems with apps closing all file descriptors.

654
- Bug #839151: Fix an attempt to access sys.argv in the warnings module
Andrew M. Kuchling's avatar
Andrew M. Kuchling committed
655
  it can be missing in embedded interpreters
656

657 658
- Bug #1155638: Fix a bug which affected HTTP 0.9 responses in httplib.

659 660 661
- Bug #1100201: Cross-site scripting was possible on BaseHTTPServer via
  error messages.

662 663
- Bug #1108948: Cookie.py produced invalid JavaScript code.

664 665 666
- The tokenize module now detects and reports indentation errors.
  Bug #1224621.

667 668 669 670 671
- The tokenize module has a new untokenize() function to support a full
  roundtrip from lexed tokens back to Python sourcecode.  In addition,
  the generate_tokens() function now accepts a callable argument that
  terminates by raising StopIteration.

672 673
- Bug #1196315: fix weakref.WeakValueDictionary constructor.

674 675 676
- Bug #1213894: os.path.realpath didn't resolve symlinks that were the first
  component of the path.

677 678 679 680 681
- Patch #1120353: The xmlrpclib module provides better, more transparent,
  support for datetime.{datetime,date,time} objects.  With use_datetime set
  to True, applications shouldn't have to fiddle with the DateTime wrapper
  class at all.

682 683 684
- distutils.commands.upload was added to support uploading distribution
  files to PyPI.

685 686 687
- distutils.commands.register now encodes the data as UTF-8 before posting
  them to PyPI.

688 689 690 691 692
- decimal operator and comparison methods now return NotImplemented
  instead of raising a TypeError when interacting with other types.  This
  allows other classes to implement __radd__ style methods and have them
  work as expected.

693 694 695
- Bug #1163325:  Decimal infinities failed to hash.  Attempting to
  hash a NaN raised an InvalidOperation instead of a TypeError.

696
- Patch #918101: Add tarfile open mode r|* for auto-detection of the
697 698
  stream compression; add, for symmetry reasons, r:* as a synonym of r.

699 700
- Patch #1043890: Add extractall method to tarfile.

701 702 703
- Patch #1075887: Don't require MSVC in distutils if there is nothing
  to build.

704 705 706
- Patch #1103407: Properly deal with tarfile iterators when untarring
  symbolic links on Windows.

707
- Patch #645894: Use getrusage for computing the time consumption in
708 709
  profile.py if available.

710 711
- Patch #1046831: Use get_python_version where appropriate in sysconfig.py.

712 713 714
- Patch #1117454: Remove code to special-case cookies without values
  in LWPCookieJar.

715 716
- Patch #1117339: Add cookielib special name tests.

717 718
- Patch #1112812: Make bsddb/__init__.py more friendly for modulefinder.

719 720
- Patch #1110248: SYNC_FLUSH the zlib buffer for GZipFile.flush.

721 722
- Patch #1107973: Allow to iterate over the lines of a tarfile.ExFileObject.

723 724
- Patch #1104111: Alter setup.py --help and --help-commands.

725 726
- Patch #1121234: Properly cleanup _exit and tkerror commands.

727 728
- Patch #1049151: xdrlib now unpacks booleans as True or False.

729 730
- Fixed bug in a NameError bug in cookielib.  Patch #1116583.

731 732 733 734
- Applied a security fix to SimpleXMLRPCserver (PSF-2005-001).  This
  disables recursive traversal through instance attributes, which can
  be exploited in various ways.

735
- Bug #1222790: in SimpleXMLRPCServer, set the reuse-address and close-on-exec
736 737
  flags on the HTTP listening socket.

738 739 740
- Bug #792570: SimpleXMLRPCServer had problems if the request grew too large.
  Fixed by reading the HTTP body in chunks instead of one big socket.read().

741
- Patches #893642, #1039083: add allow_none, encoding arguments to constructors of
742 743
  SimpleXMLRPCServer and CGIXMLRPCRequestHandler.

744 745
- Bug #1110478: Revert os.environ.update to do putenv again.

746 747
- Bug #1103844: fix distutils.install.dump_dirs() with negated options.

748 749
- os.{SEEK_SET, SEEK_CUR, SEEK_END} have been added for convenience.

750 751 752
- Enhancements to the csv module:

  + Dialects are now validated by the underlying C code, better
753
    reflecting its capabilities, and improving its compliance with
754
    PEP 305.
755 756 757 758 759
  + Dialect parameter parsing has been re-implemented to improve error
    reporting.
  + quotechar=None and quoting=QUOTE_NONE now work the way PEP 305
    dictates.
  + the parser now removes the escapechar prefix from escaped characters.
760
  + when quoting=QUOTE_NONNUMERIC, the writer now tests for numeric
761
    types, rather than any object than can be represented as a numeric.
762 763
  + when quoting=QUOTE_NONNUMERIC, the reader now casts unquoted fields
    to floats.
764 765
  + reader now allows \r characters to be quoted (previously it only allowed
    \n to be quoted).
766
  + writer doublequote handling improved.
767 768 769 770 771 772 773 774 775 776 777 778
  + Dialect classes passed to the module are no longer instantiated by
    the module before being parsed (the former validation scheme required
    this, but the mechanism was unreliable).
  + The dialect registry now contains instances of the internal
    C-coded dialect type, rather than references to python objects.
  + the internal c-coded dialect type is now immutable.
  + register_dialect now accepts the same keyword dialect specifications
    as the reader and writer, allowing the user to register dialects
    without first creating a dialect class.
  + a configurable limit to the size of parsed fields has been added -
    previously, an unmatched quote character could result in the entire
    file being read into the field buffer before an error was reported.
779
  + A new module method csv.field_size_limit() has been added that sets
780 781
    the parser field size limit (returning the former limit). The initial
    limit is 128kB.
782 783 784 785
  + A line_num attribute has been added to the reader object, which tracks
    the number of lines read from the source iterator. This is not
    the same as the number of records returned, as records can span
    multiple lines.
786 787 788
  + reader and writer objects were not being registered with the cyclic-GC.
    This has been fixed.

789 790 791 792 793
- _DummyThread objects in the threading module now delete self.__block that is
  inherited from _Thread since it uses up a lock allocated by 'thread'.  The
  lock primitives tend to be limited in number and thus should not be wasted on
  a _DummyThread object.  Fixes bug #1089632.

794 795
- The imghdr module now detects Exif files.

796 797 798
- StringIO.truncate() now correctly adjusts the size attribute.
  (Bug #951915).

799 800 801
- locale.py now uses an updated locale alias table (built using
  Tools/i18n/makelocalealias.py, a tool to parse the X11 locale
  alias file); the encoding lookup was enhanced to use Python's
Andrew M. Kuchling's avatar
Andrew M. Kuchling committed
802
  encoding alias table.
803

Raymond Hettinger's avatar
Raymond Hettinger committed
804 805
- moved deprecated modules to Lib/lib-old:  whrandom, tzparse, statcache.

806 807 808
- the pickle module no longer accepts the deprecated None option in the
  args tuple returned by __reduce__().

809 810
- optparse now optionally imports gettext.  This allows its use in setup.py.

811 812
- the pickle module no longer uses the deprecated bin parameter.

813 814
- the shelve module no longer uses the deprecated binary parameter.

815 816
- the pstats module no longer uses the deprecated ignore() method.

817 818
- the filecmp module no longer uses the deprecated use_statcache argument.

819 820 821 822
- unittest.TestCase.run() and unittest.TestSuite.run() can now be successfully
  extended or overridden by subclasses.  Formerly, the subclassed method would
  be ignored by the rest of the module.  (Bug #1078905).

823
- heapq.nsmallest() and heapq.nlargest() now support key= arguments with
824
  the same meaning as in list.sort().
825

826 827 828 829
- Bug #1076985: ``codecs.StreamReader.readline()`` now calls ``read()`` only
  once when a size argument is given. This prevents a buffer overflow in the
  tokenizer with very long source lines.

Barry Warsaw's avatar
Barry Warsaw committed
830 831 832
- Bug #1083110: ``zlib.decompress.flush()`` would segfault if called
  immediately after creating the object, without any intervening
  ``.decompress()`` calls.
833

834 835
- The reconvert.quote function can now emit triple-quoted strings.  The
  reconvert module now has some simple documentation.
836

837 838 839
- ``UserString.MutableString`` now supports negative indices in
  ``__setitem__`` and ``__delitem__``

840 841 842
- Bug #1149508: ``textwrap`` now handles hyphenated numbers (eg. "2004-03-05")
  correctly.

843 844 845 846 847 848
- Partial fixes for SF bugs #1163244 and #1175396: If a chunk read by
  ``codecs.StreamReader.readline()`` has a trailing "\r", read one more
  character even if the user has passed a size parameter to get a proper
  line ending. Remove the special handling of a "\r\n" that has been split
  between two lines.

849 850 851 852
- Bug #1251300: On UCS-4 builds the "unicode-internal" codec will now complain
  about illegal code points. The codec now supports PEP 293 style error
  handlers.

Walter Dörwald's avatar
Walter Dörwald committed
853
- Bug #1235646: ``codecs.StreamRecoder.next()`` now reencodes the data it reads
854 855
  from the input stream, so that the output is a byte string in the correct
  encoding instead of a unicode string.
856

857 858 859
- Bug #1202493: Fixing SRE parser to handle '{}' as perl does, rather than
  considering it exactly like a '*'.

860 861
- Bug #1245379: Add "unicode-1-1-utf-7" as an alias for "utf-7" to
  ``encodings.aliases``.
862

863 864
- ` uu.encode()`` and ``uu.decode()`` now support unicode filenames.

865 866 867
- Patch #1413711: Certain patterns of differences were making difflib
  touch the recursion limit.

868 869 870
Build
-----

871 872
- Patch #1432345: Make python compile on DragonFly.

873 874
- Build support for Win64-AMD64 was added.

875 876
- Patch #1428494: Prefer linking against ncursesw over ncurses library.

877 878
- Patch #881820: look for openpty and forkpty also in libbsd.

879 880 881
- The sources of zlib are now part of the Python distribution (zlib 1.2.3).
  The zlib module is now builtin on Windows.

882 883
- Use -xcode=pic32 for CCSHARED on Solaris with SunPro.

884 885 886
- Bug #1189330: configure did not correctly determine the necessary
  value of LINKCC if python was built with GCC 4.0.

887 888 889
- Upgrade Windows build to zlib 1.2.3 which eliminates a potential security
  vulnerability in zlib 1.2.1 and 1.2.2.

890 891 892
- EXTRA_CFLAGS has been introduced as an environment variable to hold compiler
  flags that change binary compatibility.  Changes were also made to
  distutils.sysconfig to also use the environment variable when used during
893
  compilation of the interpreter and of C extensions through distutils.
894

895 896 897 898
- SF patch 1171735: Darwin 8's headers are anal about POSIX compliance,
  and linking has changed (prebinding is now deprecated, and libcc_dynamic
  no longer exists). This configure patch makes things right.

899 900
- Bug #1158607: Build with --disable-unicode again.

901 902 903
- spwdmodule.c is built only if either HAVE_GETSPNAM or HAVE_HAVE_GETSPENT is
  defined.  Discovered as a result of not being able to build on OS X.

904 905 906 907 908 909 910 911 912
- setup.py now uses the directories specified in LDFLAGS using the -L option
  and in CPPFLAGS using the -I option for adding library and include
  directories, respectively, for compiling extension modules against.  This has
  led to the core being compiled using the values in CPPFLAGS.  It also removes
  the need for the special-casing of both DarwinPorts and Fink for darwin since
  the proper directories can be specified in LDFLAGS (``-L/sw/lib`` for Fink,
  ``-L/opt/local/lib`` for DarwinPorts) and CPPFLAGS (``-I/sw/include`` for
  Fink, ``-I/opt/local/include`` for DarwinPorts).

913 914 915 916 917
- Test in configure.in that checks for tzset no longer dependent on tm->tm_zone
  to exist in the struct (not required by either ISO C nor the UNIX 2 spec).
  Tests for sanity in tzname when HAVE_TZNAME defined were also defined.
  Closes bug #1096244.  Thanks Gregory Bond.

918 919 920
C API
-----

921 922
- Added a C API for set and frozenset objects.

Raymond Hettinger's avatar
Raymond Hettinger committed
923 924
- Removed PyRange_New().

925 926 927 928 929
- Patch #1313939: PyUnicode_DecodeCharmap() accepts a unicode string as the
  mapping argument now. This string is used as a mapping table. Byte values
  greater than the length of the string and 0xFFFE are treated as undefined
  mappings.

930 931 932 933

Tests
-----

934
- In test_os, st_?time is now truncated before comparing it with ST_?TIME.
935

936 937 938
- Patch #1276356: New resource "urlfetch" is implemented.  This enables
  even impatient people to run tests that require remote files.

939 940 941 942

Documentation
-------------

943 944
- Bug #1402224: Add warning to dl docs about crashes.

945 946 947
- Bug #1396471: Document that Windows' ftell() can return invalid
  values for text files with UNIX-style line endings.

948 949
- Bug #1274828: Document os.path.splitunc().

950 951
- Bug #1190204: Clarify which directories are searched by site.py.

952 953
- Bug #1193849: Clarify os.path.expanduser() documentation.

954 955
- Bug #1243192: re.UNICODE and re.LOCALE affect \d, \D, \s and \S.

956 957
- Bug #755617: Document the effects of os.chown() on Windows.

958 959 960 961 962 963 964 965
- Patch #1180012: The documentation for modulefinder is now in the library reference.

- Patch #1213031: Document that os.chown() accepts argument values of -1.

- Bug #1190563: Document os.waitpid() return value with WNOHANG flag.

- Bug #1175022: Correct the example code for property().

Georg Brandl's avatar
Georg Brandl committed
966 967 968
- Document the IterableUserDict class in the UserDict module.
  Closes bug #1166582.

969 970 971 972
- Remove all latent references for "Macintosh" that referred to semantics for
  Mac OS 9 and change to reflect the state for OS X.
  Closes patch #1095802.  Thanks Jack Jansen.

973 974 975 976
Mac
---


Hye-Shik Chang's avatar
Hye-Shik Chang committed
977 978 979 980 981
New platforms
-------------

- FreeBSD 7 support is added.

982 983 984 985

Tools/Demos
-----------

986 987 988 989
- Created Misc/Vim/vim_syntax.py to auto-generate a python.vim file in that
  directory for syntax highlighting in Vim.  Vim directory was added and placed
  vimrc to it (was previous up a level).

990 991 992 993 994
- Added two new files to Tools/scripts: pysource.py, which recursively
  finds Python source files, and findnocoding.py, which finds Python
  source files that need an encoding declaration.
  Patch #784089, credits to Oleg Broytmann.

995 996
- Bug #1072853: pindent.py used an uninitialized variable.

997
- Patch #1177597: Correct Complex.__init__.
998

Barry Warsaw's avatar
Barry Warsaw committed
999 1000
- Fixed a display glitch in Pynche, which could cause the right arrow to
  wiggle over by a pixel.
1001

1002 1003
What's New in Python 2.4 final?
===============================
1004

1005
*Release date: 30-NOV-2004*
1006 1007 1008 1009 1010 1011 1012 1013 1014

Core and builtins
-----------------

- Bug 875692: Improve signal handling, especially when using threads, by
  forcing an early re-execution of PyEval_EvalFrame() "periodic" code when
  things_to_do is not cleared by Py_MakePendingCalls().


Anthony Baxter's avatar
whoops!  
Anthony Baxter committed
1015 1016
What's New in Python 2.4 (release candidate 1)
==============================================
Anthony Baxter's avatar
Anthony Baxter committed
1017

Anthony Baxter's avatar
2.4rc1  
Anthony Baxter committed
1018
*Release date: 18-NOV-2004*
Anthony Baxter's avatar
Anthony Baxter committed
1019 1020 1021 1022

Core and builtins
-----------------

1023 1024 1025 1026
- Bug 1061968:  Fixes in 2.4a3 to address thread bug 1010677 reintroduced
  the years-old thread shutdown race bug 225673.  Numeric history lesson
  aside, all bugs in all three reports are fixed now.

Anthony Baxter's avatar
Anthony Baxter committed
1027

1028 1029 1030 1031 1032 1033 1034
Library
-------

- Bug 1052242: If exceptions are raised by an atexit handler function an
  attempt is made to execute the remaining handlers.  The last exception
  raised is re-raised.

1035 1036 1037
- ``doctest``'s new support for adding ``pdb.set_trace()`` calls to
  doctests was broken in a dramatic but shallow way.  Fixed.

1038 1039 1040 1041 1042 1043 1044 1045 1046 1047 1048
- Bug 1065388:  ``calendar``'s ``day_name``, ``day_abbr``, ``month_name``,
  and ``month_abbr`` attributes emulate sequences of locale-correct
  spellings of month and day names.  Because the locale can change at
  any time, the correct spelling is recomputed whenever one of these is
  indexed.  In the worst case, the index may be a slice object, so these
  recomputed every day or month name each time they were indexed.  This is
  much slower than necessary in the usual case, when the index is just an
  integer.  In that case, only the single spelling needed is recomputed
  now; and, when the index is a slice object, only the spellings needed
  by the slice are recomputed now.

1049
- Patch 1061679: Added ``__all__`` to pickletools.py.
1050

Anthony Baxter's avatar
Anthony Baxter committed
1051 1052 1053
Build
-----

1054 1055 1056
- Bug 1034277 / Patch 1035255: Remove compilation of core against CoreServices
  and CoreFoundation on OS X.  Involved removing PyMac_GetAppletScriptFile()
  which has no known users.  Thanks Bob Ippolito.
Anthony Baxter's avatar
Anthony Baxter committed
1057 1058 1059 1060

C API
-----

1061 1062
- The PyRange_New() function is deprecated.

Anthony Baxter's avatar
Anthony Baxter committed
1063

Brett Cannon's avatar
Brett Cannon committed
1064 1065 1066
What's New in Python 2.4 beta 2?
================================

Anthony Baxter's avatar
Anthony Baxter committed
1067
*Release date: 03-NOV-2004*
Brett Cannon's avatar
Brett Cannon committed
1068

1069 1070 1071 1072 1073 1074 1075 1076 1077 1078 1079 1080 1081 1082 1083 1084 1085 1086 1087 1088
License
-------

The Python Software Foundation changed the license under which Python
is released, to remove Python version numbers.  There were no other
changes to the license.  So, for example, wherever the license for
Python 2.3 said "Python 2.3", the new license says "Python".  The
intent is to make it possible to refer to the PSF license in a more
durable way.  For example, some people say they're confused by that
the Open Source Initiative's entry for the Python Software Foundation
License::

      http://www.opensource.org/licenses/PythonSoftFoundation.php

says "Python 2.1.1" all over it, wondering whether it applies only
to Python 2.1.1.

The official name of the new license is the Python Software Foundation
License Version 2.

Brett Cannon's avatar
Brett Cannon committed
1089 1090 1091
Core and builtins
-----------------

1092 1093 1094 1095 1096 1097 1098
- Bug #1055820 Cyclic garbage collection was not protecting against that
  calling a live weakref to a piece of cyclic trash could resurrect an
  insane mutation of the trash if any Python code ran during gc (via
  running a dead object's __del__ method, running another callback on a
  weakref to a dead object, or via any Python code run in any other thread
  that managed to obtain the GIL while a __del__ or callback was running
  in the thread doing gc).  The most likely symptom was "impossible"
Andrew M. Kuchling's avatar
Andrew M. Kuchling committed
1099
  ``AttributeError`` exceptions, appearing seemingly at random, on weakly
1100 1101 1102
  referenced objects.  The cure was to clear all weakrefs to unreachable
  objects before allowing any callbacks to run.

1103
- Bug #1054139 _PyString_Resize() now invalidates its cached hash value.
Brett Cannon's avatar
Brett Cannon committed
1104 1105 1106 1107

Extension Modules
-----------------

1108 1109 1110 1111
- Bug #1048870:  the compiler now generates distinct code objects for
  functions with identical bodies.  This was producing confusing
  traceback messages which pointed to the function where the code
  object was first defined rather than the function being executed.
Brett Cannon's avatar
Brett Cannon committed
1112 1113 1114 1115

Library
-------

1116 1117 1118 1119
- Patch #1056967 changes the semantics of Template.safe_substitute() so that
  no ValueError is raised on an 'invalid' match group.  Now the delimiter is
  returned.

1120 1121
- Bug #1052503 pdb.runcall() was not passing along keyword arguments.

1122 1123 1124
- Bug #902037: XML.sax.saxutils.prepare_input_source() now combines relative
  paths with a base path before checking os.path.isfile().

1125 1126
- The whichdb module can now be run from the command line.

1127 1128
- Bug #1045381: time.strptime() can now infer the date using %U or %W (week of
  the year) when the day of the week and year are also specified.
Brett Cannon's avatar
Brett Cannon committed
1129

Andrew M. Kuchling's avatar
Andrew M. Kuchling committed
1130 1131
- Bug #1048816: fix bug in Ctrl-K at start of line in curses.textpad.Textbox

Andrew M. Kuchling's avatar
Andrew M. Kuchling committed
1132 1133
- Bug #1017553: fix bug in tarfile.filemode()

1134
- Patch #737473: fix bug that old source code is shown in tracebacks even if
1135 1136
  the source code is updated and reloaded.

Brett Cannon's avatar
Brett Cannon committed
1137 1138 1139
Build
-----

1140
- Patch #1044395: --enable-shared is allowed in FreeBSD also.
Brett Cannon's avatar
Brett Cannon committed
1141

1142 1143 1144
What's New in Python 2.4 beta 1?
================================

Anthony Baxter's avatar
Anthony Baxter committed
1145
*Release date: 15-OCT-2004*
1146 1147 1148 1149

Core and builtins
-----------------

1150
- Patch #975056: Restartable signals were not correctly disabled on
Anthony Baxter's avatar
969574  
Anthony Baxter committed
1151 1152
  BSD systems. Consistently use PyOS_setsig() instead of signal().

1153 1154 1155 1156 1157
- The internal portable implementation of thread-local storage (TLS), used
  by the ``PyGILState_Ensure()``/``PyGILState_Release()`` API, was not
  thread-correct.  This could lead to a variety of problems, up to and
  including segfaults.  See bug 1041645 for an example.

1158 1159 1160
- Added a command line option, -m module, which searches sys.path for the
  module and then runs it.  (Contributed by Nick Coghlan.)

1161 1162 1163
- The bytecode optimizer now folds tuples of constants into a single
  constant.

1164 1165 1166 1167 1168 1169 1170 1171
- SF bug #513866:  Float/long comparison anomaly.  Prior to 2.4b1, when
  an integer was compared to a float, the integer was coerced to a float.
  That could yield spurious overflow errors (if the integer was very
  large), and to anomalies such as
  ``long(1e200)+1 == 1e200 == long(1e200)-1``.  Coercion to float is no
  longer performed, and cases like ``long(1e200)-1 < 1e200``,
  ``long(1e200)+1 > 1e200`` and ``(1 << 20000) > 1e200`` are computed
  correctly now.
1172 1173 1174 1175

Extension modules
-----------------

Tim Peters's avatar
Tim Peters committed
1176 1177
- ``collections.deque`` objects didn't play quite right with garbage
  collection, which could lead to a segfault in a release build, or
1178 1179 1180
  an assert failure in a debug build.  Also, added overflow checks,
  better detection of mutation during iteration, and shielded deque
  comparisons from unusual subclass overrides of the __iter__() method.
1181 1182 1183

Library
-------
1184

1185
- Patch 1046644: distutils build_ext grew two new options - --swig for
1186
  specifying the swig executable to use, and --swig-opts to specify
1187 1188 1189
  options to pass to swig. --swig-opts="-c++" is the new way to spell
  --swig-cpp.

1190 1191 1192
- Patch 983206: distutils now obeys environment variable LDSHARED, if
  it is set.

Andrew M. Kuchling's avatar
Andrew M. Kuchling committed
1193
- Added Peter Astrand's subprocess.py module.  See PEP 324 for details.
1194

1195 1196 1197 1198 1199
- time.strptime() now properly escapes timezones and all other locale-specific
  strings for regex-specific symbols.  Was breaking under Japanese Windows when
  the timezone was specified as "Tokyo (standard time)".
  Closes bug #1039270.

1200
- Updates for the email package:
1201

1202
  + email.Utils.formatdate() grew a 'usegmt' argument for HTTP support.
1203 1204 1205 1206 1207 1208 1209 1210 1211 1212 1213 1214
  + All deprecated APIs that in email 2.x issued warnings have been removed:
    _encoder argument to the MIMEText constructor, Message.add_payload(),
    Utils.dump_address_pair(), Utils.decode(), Utils.encode()
  + New deprecations: Generator.__call__(), Message.get_type(),
    Message.get_main_type(), Message.get_subtype(), the 'strict' argument to
    the Parser constructor.  These will be removed in email 3.1.
  + Support for Python earlier than 2.3 has been removed (see PEP 291).
  + All defect classes have been renamed to end in 'Defect'.
  + Some FeedParser fixes; also a MultipartInvariantViolationDefect will be
    added to messages that claim to be multipart but really aren't.
  + Updates to documentation.

1215 1216 1217 1218 1219
- re's findall() and finditer() functions now take an optional flags argument
  just like the compile(), search(), and match() functions.  Also, documented
  the previously existing start and stop parameters for the findall() and
  finditer() methods of regular expression objects.

1220
- rfc822 Messages now support iterating over the headers.
1221

1222 1223 1224
- The (undocumented) tarfile.Tarfile.membernames has been removed;
  applications should use the getmember function.

1225 1226
- httplib now offers symbolic constants for the HTTP status codes.

1227 1228 1229 1230 1231 1232 1233 1234 1235 1236
- SF bug #1028306:  Trying to compare a ``datetime.date`` to a
  ``datetime.datetime`` mistakenly compared only the year, month and day.
  Now it acts like a mixed-type comparison:  ``False`` for ``==``,
  ``True`` for ``!=``, and raises ``TypeError`` for other comparison
  operators.  Because datetime is a subclass of date, comparing only the
  base class (date) members can still be done, if that's desired, by
  forcing using of the approprate date method; e.g.,
  ``a_date.__eq__(a_datetime)`` is true if and only if the year, month
  and day members of ``a_date`` and ``a_datetime`` are equal.

1237 1238
- bdist_rpm now supports command line options --force-arch,
  {pre,post}-install,  {pre,post}-uninstall, and
1239
  {prep,build,install,clean,verify}-script.
1240

1241 1242 1243 1244 1245 1246 1247 1248
- SF patch #998993: The UTF-8 and the UTF-16 stateful decoders now support
  decoding incomplete input (when the input stream is temporarily exhausted).
  ``codecs.StreamReader`` now implements buffering, which enables proper
  readline support for the UTF-16 decoders. ``codecs.StreamReader.read()``
  has a new argument ``chars`` which specifies the number of characters to
  return. ``codecs.StreamReader.readline()`` and
  ``codecs.StreamReader.readlines()`` have a new argument ``keepends``.
  Trailing "\n"s will be stripped from the lines if ``keepends`` is false.
1249

Tim Peters's avatar
Tim Peters committed
1250 1251 1252
- The documentation for doctest is greatly expanded, and now covers all
  the new public features (of which there are many).

Walter Dörwald's avatar
Walter Dörwald committed
1253
- ``doctest.master`` was put back in, and ``doctest.testmod()`` once again
1254 1255 1256 1257
  updates it.  This isn't good, because every ``testmod()`` call
  contributes to bloating the "hidden" state of ``doctest.master``, but
  some old code apparently relies on it.  For now, all we can do is
  encourage people to stitch doctests together via doctest's unittest
Tim Peters's avatar
Tim Peters committed
1258
  integration features instead.
1259

Skip Montanaro's avatar
Skip Montanaro committed
1260 1261
- httplib now handles ipv6 address/port pairs.

1262 1263 1264 1265 1266
- SF bug #1017864: ConfigParser now correctly handles default keys,
  processing them with ``ConfigParser.optionxform`` when supplied,
  consistent with the handling of config file entries and runtime-set
  options.

1267 1268 1269 1270 1271 1272
- SF bug #997050: Document, test, & check for non-string values in
  ConfigParser.  Moved the new string-only restriction added in
  rev. 1.65 to the SafeConfigParser class, leaving existing
  ConfigParser & RawConfigParser behavior alone, and documented the
  conditions under which non-string values work.

1273 1274 1275
Build
-----

1276 1277 1278 1279
- Building on darwin now includes /opt/local/include and /opt/local/lib for
  building extension modules.  This is so as to include software installed as
  a DarwinPorts port <http://darwinports.opendarwin.org/>

1280 1281 1282 1283 1284
- pyport.h now defines a Py_IS_NAN macro.  It works as-is when the
  platform C computes true for ``x != x`` if and only if X is a NaN.
  Other platforms can override the default definition with a platform-
  specific spelling in that platform's pyconfig.h.  You can also override
  pyport.h's default Py_IS_INFINITY definition now.
1285 1286 1287 1288

C API
-----

1289 1290 1291 1292 1293
- SF patch 1044089:  New function ``PyEval_ThreadsInitialized()`` returns
  non-zero if PyEval_InitThreads() has been called.

- The undocumented and unused extern int ``_PyThread_Started`` was removed.

1294 1295 1296 1297 1298 1299 1300 1301 1302 1303
- The C API calls ``PyInterpreterState_New()`` and ``PyThreadState_New()``
  are two of the very few advertised as being safe to call without holding
  the GIL.  However, this wasn't true in a debug build, as bug 1041645
  demonstrated.  In a debug build, Python redirects the ``PyMem`` family
  of calls to Python's small-object allocator, to get the benefit of
  its extra debugging capabilities.  But Python's small-object allocator
  isn't threadsafe, relying on the GIL to avoid the expense of doing its
  own locking.  ``PyInterpreterState_New()`` and ``PyThreadState_New()``
  call the platform ``malloc()`` directly now, regardless of build type.

1304 1305
- PyLong_AsUnsignedLong[Mask] now support int objects as well.

1306 1307 1308
- SF patch #998993: ``PyUnicode_DecodeUTF8Stateful`` and
  ``PyUnicode_DecodeUTF16Stateful`` have been added, which implement stateful
  decoding.
1309 1310 1311 1312 1313 1314 1315 1316 1317

Tests
-----

- test__locale ported to unittest

Mac
---

Just van Rossum's avatar
Just van Rossum committed
1318 1319 1320
- ``plistlib`` now supports non-dict root objects.  There is also a new
  interface for reading and writing plist files: ``readPlist(pathOrFile)``
  and ``writePlist(rootObject, pathOrFile)``
1321 1322 1323 1324

Tools/Demos
-----------

1325 1326 1327 1328 1329
- The text file comparison scripts ``ndiff.py`` and ``diff.py`` now
  read the input files in universal-newline mode.  This spares them
  from consuming a great deal of time to deduce the useless result that,
  e.g., a file with Windows line ends and a file with Linux line ends
  have no lines in common.
1330 1331


1332 1333 1334
What's New in Python 2.4 alpha 3?
=================================

Anthony Baxter's avatar
Anthony Baxter committed
1335
*Release date: 02-SEP-2004*
1336 1337 1338 1339

Core and builtins
-----------------

1340
- SF patch #1007189: ``from ... import ...`` statements now allow the name
1341 1342
  list to be surrounded by parentheses.

1343 1344 1345 1346 1347 1348 1349 1350
- Some speedups for long arithmetic, thanks to Trevor Perrin.  Gradeschool
  multiplication was sped a little by optimizing the C code.  Gradeschool
  squaring was sped by about a factor of 2, by exploiting that about half
  the digit products are duplicates in a square.  Because exponentiation
  uses squaring often, this also speeds long power.  For example, the time
  to compute 17**1000000 dropped from about 14 seconds to 9 on my box due
  to this much.  The cutoff for Karatsuba multiplication was raised,
  since gradeschool multiplication got quicker, and the cutoff was
1351 1352 1353 1354 1355
  aggressively small regardless.  The exponentiation algorithm was switched
  from right-to-left to left-to-right, which is more efficient for small
  bases.  In addition, if the exponent is large, the algorithm now does
  5 bits (instead of 1 bit) at a time.  That cut the time to compute
  17**1000000 on my box in half again, down to about 4.5 seconds.
1356

1357 1358 1359 1360 1361 1362
- OverflowWarning is no longer generated.  PEP 237 scheduled this to
  occur in Python 2.3, but since OverflowWarning was disabled by default,
  nobody realized it was still being generated.  On the chance that user
  code is still using them, the Python builtin OverflowWarning, and
  corresponding C API PyExc_OverflowWarning, will exist until Python 2.5.

1363 1364
- Py_InitializeEx has been added.

1365 1366 1367
- Fix the order of application of decorators.  The proper order is bottom-up;
  the first decorator listed is the last one called.

1368 1369 1370 1371
- SF patch #1005778.  Fix a seg fault if the list size changed while
  calling list.index().  This could happen if a rich comparison function
  modified the list.

Michael W. Hudson's avatar
Michael W. Hudson committed
1372 1373 1374
- The ``func_name`` (a.k.a. ``__name__``) attribute of user-defined
  functions is now writable.

1375 1376 1377 1378 1379
- code_new (a.k.a new.code()) now checks its arguments sufficiently
  carefully that passing them on to PyCode_New() won't trigger calls
  to Py_FatalError() or PyErr_BadInternalCall().  It is still the case
  that the returned code object might be entirely insane.

1380 1381 1382 1383 1384
- Subclasses of string can no longer be interned.  The semantics of
  interning were not clear here -- a subclass could be mutable, for
  example -- and had bugs.  Explicitly interning a subclass of string
  via intern() will raise a TypeError.  Internal operations that attempt
  to intern a string subclass will have no effect.
1385

Tim Peters's avatar
Tim Peters committed
1386 1387 1388
- Bug 1003935:  xrange() could report bogus OverflowErrors.  Documented
  what xrange() intends, and repaired tests accordingly.

1389 1390 1391
Extension modules
-----------------

1392 1393
- difflib now supports HTML side-by-side diff.

1394 1395 1396
- os.urandom has been added for systems that support sources of random
  data.

1397 1398 1399 1400 1401
- Patch 1012740:  truncate() on a writeable cStringIO now resets the
  position to the end of the stream.  This is consistent with the original
  StringIO module and avoids inadvertently resurrecting data that was
  supposed to have been truncated away.

1402 1403
- Added socket.socketpair().

1404 1405 1406
- Added CurrentByteIndex, CurrentColumnNumber, CurrentLineNumber
  members to xml.parsers.expat.XMLParser object.

1407
- The mpz, rotor, and xreadlines modules, all deprecated in earlier
Andrew M. Kuchling's avatar
Andrew M. Kuchling committed
1408 1409
  versions of Python, have now been removed.

1410 1411 1412
Library
-------

1413
- Patch #934356: if a module defines __all__, believe that rather than using
1414
  heuristics for filtering out imported names.
1415

1416 1417 1418
- Patch #941486: added os.path.lexists(), which returns True for broken
  symlinks, unlike os.path.exists().

1419 1420 1421
- the random module now uses os.urandom() for seeding if it is available.
  Added a new generator based on os.urandom().

1422 1423
- difflib and diff.py can now generate HTML.

1424 1425
- bdist_rpm now includes version and release in the BuildRoot, and
  replaces - by ``_`` in version and release.
1426

1427 1428 1429
- distutils build/build_scripts now has an -e option to specify the
  path to the Python interpreter for installed scripts.

1430 1431
- PEP 292 classes Template and SafeTemplate are added to the string module.

1432 1433
- tarfile now generates GNU tar files by default.

1434 1435
- HTTPResponse has now a getheaders method.

1436 1437 1438
- Patch #1006219: let inspect.getsource handle '@' decorators. Thanks Simon
  Percivall.

1439 1440 1441
- logging.handlers.SMTPHandler.date_time has been removed;
  the class now uses email.Utils.formatdate to generate the time stamp.

1442 1443 1444
- A new function tkFont.nametofont was added to return an existing
  font. The Font class constructor now has an additional exists argument
  which, if True, requests to return/configure an existing font, rather
1445
  than creating a new one.
1446

Raymond Hettinger's avatar
Raymond Hettinger committed
1447 1448 1449 1450 1451 1452 1453 1454
- Updated the decimal package's min() and max() methods to match the
  latest revision of the General Decimal Arithmetic Specification.
  Quiet NaNs are ignored and equal values are sorted based on sign
  and exponent.

- The decimal package's Context.copy() method now returns deep copies.

- Deprecated sys.exitfunc in favor of the atexit module.  The sys.exitfunc
Georg Brandl's avatar
Georg Brandl committed
1455
  attribute will be kept around for backwards compatibility and atexit
Raymond Hettinger's avatar
Raymond Hettinger committed
1456 1457
  will just become the one preferred way to do it.

1458 1459 1460
- patch #675551: Add get_history_item and replace_history_item functions
  to the readline module.

1461
- bug #989672: pdb.doc and the help messages for the help_d and help_u methods
1462 1463
  of the pdb.Pdb class gives have been corrected. d(own) goes to a newer
  frame, u(p) to an older frame, not the other way around.
1464

1465 1466 1467 1468
- bug #990669: os.path.realpath() will resolve symlinks before normalizing the
  path, as normalizing the path may alter the meaning of the path if it
  contains symlinks.

1469 1470 1471
- bug #851123: shutil.copyfile will raise an exception when trying to copy a
  file onto a link to itself. Thanks Gregory Ball.

1472 1473 1474 1475
- bug #570300: Fix inspect to resolve file locations using os.path.realpath()
  so as to properly list all functions in a module when the module itself is
  reached through a symlink.  Thanks Johannes Gijsbers.

1476 1477 1478 1479 1480 1481 1482 1483 1484 1485 1486 1487 1488 1489
- doctest refactoring continued.  See the docs for details.  As part of
  this effort, some old and little- (never?) used features are now
  deprecated:  the Tester class, the module is_private() function, and the
  isprivate argument to testmod().  The Tester class supplied a feeble
  "by hand" way to combine multiple doctests, if you knew exactly what
  you were doing.  The newer doctest features for unittest integration
  already did a better job of that, are stronger now than ever, and the
  new DocTestRunner class is a saner foundation if you want to do it by
  hand.  The "private name" filtering gimmick was a mistake from the
  start, and testmod() changed long ago to ignore it by default.  If
  you want to filter out tests, the new DocTestFinder class can be used
  to return a list of all doctests, and you can filter that list by
  any computable criteria before passing it to a DocTestRunner instance.

1490 1491
- Bug #891637, patch #1005466: fix inspect.getargs() crash on def foo((bar)).

1492 1493 1494
Tools/Demos
-----------

Raymond Hettinger's avatar
Raymond Hettinger committed
1495 1496 1497
- IDLE's shortcut keys for windows are now case insensitive so that
  Control-V works the same as Control-v.

1498 1499
- pygettext.py: Generate POT-Creation-Date header in ISO format.

1500 1501 1502
Build
-----

1503 1504 1505 1506 1507 1508 1509 1510
- Backward incompatibility:  longintrepr.h now triggers a compile-time
  error if SHIFT (the number of bits in a Python long "digit") isn't
  divisible by 5.  This new requirement allows simple code for the new
  5-bits-at-a-time long_pow() implementation.  If necessary, the
  restriction could be removed (by complicating long_pow(), or by
  falling back to the 1-bit-at-a-time algorithm), but there are no
  plans to do so.

1511 1512 1513
- bug #991962: When building with --disable-toolbox-glue on Darwin no
  attempt to build Mac-specific modules occurs.

Michael W. Hudson's avatar
Michael W. Hudson committed
1514 1515 1516
- The --with-tsc flag to configure to enable VM profiling with the
  processor's timestamp counter now works on PPC platforms.

1517 1518 1519
- patch #1006629: Define _XOPEN_SOURCE to 500 on Solaris 8/9 to match
  GCC's definition and avoid redefinition warnings.

1520 1521 1522
- Detect pthreads support (provided by gnu pth pthread emulation) on
  GNU/k*BSD systems.

1523 1524 1525
- bug #1005737, #1007249: Fixed several build problems and warnings
  found on old/legacy C compilers of HP-UX, IRIX and Tru64.

1526 1527 1528
C API
-----

1529 1530
..

1531 1532 1533
Documentation
-------------

1534 1535 1536 1537 1538 1539
- patch #1005936, bug #1009373: fix index entries which contain
  an underscore when viewed with Acrobat.

- bug #990669: os.path.normpath may alter the meaning of a path if
  it contains symbolic links. This has been documented in a comment
  since 1992, but is now in the library reference as well.
1540

1541 1542 1543
New platforms
-------------

Hye-Shik Chang's avatar
Hye-Shik Chang committed
1544 1545
- FreeBSD 6 is now supported.

1546 1547 1548
Tests
-----

1549 1550
..

1551 1552 1553
Windows
-------

1554 1555 1556 1557 1558 1559 1560 1561 1562 1563
- Boosted the stack reservation for python.exe and pythonw.exe from
  the default 1MB to 2MB.  Stack frames under VC 7.1 for 2.4 are enough
  bigger than under VC 6.0 for 2.3.4 that deeply recursive progams
  within the default sys.getrecursionlimit() default value of 1000 were
  able to suffer undetected C stack overflows.  The standard test program
  test_compiler was one such program.  If a Python process on Windows
  "just vanishes" without a trace, and without an error message of any
  kind, but with an exit code of 128, undetected stack overflow may be
  the problem.

1564 1565 1566
Mac
---

1567 1568
..

1569

Anthony Baxter's avatar
Anthony Baxter committed
1570 1571 1572
What's New in Python 2.4 alpha 2?
=================================

Anthony Baxter's avatar
Anthony Baxter committed
1573
*Release date: 05-AUG-2004*
Anthony Baxter's avatar
Anthony Baxter committed
1574 1575 1576 1577

Core and builtins
-----------------

1578 1579 1580 1581 1582
- Patch #980695:  Implements efficient string concatenation for statements
  of the form s=s+t and s+=t.  This will vary across implementations.
  Accordingly, the str.join() method is strongly preferred for performance
  sensitive code.

Anthony Baxter's avatar
Anthony Baxter committed
1583
- PEP-0318, Function Decorators have been added to the language. These are
1584
  implemented using the Java-style @decorator syntax, like so::
1585

Anthony Baxter's avatar
Anthony Baxter committed
1586 1587
     @staticmethod
     def foo(bar):
1588

Anthony Baxter's avatar
Anthony Baxter committed
1589
  (The PEP needs to be updated to reflect the current state)
Anthony Baxter's avatar
Anthony Baxter committed
1590

1591 1592 1593 1594 1595 1596 1597 1598 1599 1600 1601 1602 1603 1604 1605 1606 1607 1608 1609 1610
- When importing a module M raises an exception, Python no longer leaves M
  in sys.modules.  Before 2.4a2 it did, and a subsequent import of M would
  succeed, picking up a module object from sys.modules reflecting as much
  of the initialization of M as completed before the exception was raised.
  Subsequent imports got no indication that M was in a partially-
  initialized state, and the importers could get into arbitrarily bad
  trouble as a result (the M they got was in an unintended state,
  arbitrarily far removed from M's author's intent).  Now subsequent
  imports of M will continue raising exceptions (but if, for example, the
  source code for M is edited between import attempts, then perhaps later
  attempts will succeed, or raise a different exception).

  This can break existing code, but in such cases the code was probably
  working before by accident.  In the Python source, the only case of
  breakage discovered was in a test accidentally relying on a damaged
  module remaining in sys.modules.  Cases are also known where tests
  deliberately provoking import errors remove damaged modules from
  sys.modules themselves, and such tests will break now if they do an
  unconditional del sys.modules[M].

1611 1612 1613
- u'%s' % obj will now try obj.__unicode__() first and fallback to
  obj.__str__() if no __unicode__ method can be found.

1614
- Patch #550732: Add PyArg_VaParseTupleAndKeywords().  Analogous to
1615
  PyArg_VaParse().  Both are now documented.  Thanks Greg Chapman.
1616

Anthony Baxter's avatar
Anthony Baxter committed
1617
- Allow string and unicode return types from .encode()/.decode()
1618
  methods on string and unicode objects.  Added unicode.decode()
Anthony Baxter's avatar
Anthony Baxter committed
1619 1620
  which was missing for no apparent reason.

1621
- An attempt to fix the mess that is Python's behaviour with
Anthony Baxter's avatar
Anthony Baxter committed
1622 1623 1624
  signal handlers and threads, complicated by readline's behaviour.
  It's quite possible that there are still bugs here.

1625 1626 1627
- Added C macros Py_CLEAR and Py_VISIT to ease the implementation of
  types that support garbage collection.

Raymond Hettinger's avatar
Raymond Hettinger committed
1628 1629
- Compiler now treats None as a constant.

1630 1631 1632 1633 1634
- The type of values returned by __int__, __float__, __long__,
  __oct__, and __hex__ are now checked.  Returning an invalid type
  will cause a TypeError to be raised.  This matches the behavior of
  Jython.

1635 1636
- Implemented bind_textdomain_codeset() in locale module.

1637 1638 1639
- Added a workaround for proper string operations in BSDs.  str.split
  and str.is* methods can now work correctly with UTF-8 locales.

1640 1641 1642
- Bug #989185: unicode.iswide() and unicode.width() is dropped and
  the East Asian Width support is moved to unicodedata extension
  module.
1643

1644 1645 1646 1647
- Patch #941229: The source code encoding in interactive mode
  now refers sys.stdin.encoding not just ISO-8859-1 anymore.  This
  allows for non-latin-1 users to write unicode strings directly.

Anthony Baxter's avatar
Anthony Baxter committed
1648 1649 1650
Extension modules
-----------------

1651 1652
- cpickle now supports the same keyword arguments as pickle.

Anthony Baxter's avatar
Anthony Baxter committed
1653 1654 1655
Library
-------

1656 1657 1658
- Added new codecs and aliases for ISO_8859-11, ISO_8859-16 and
  TIS-620

1659 1660 1661 1662 1663 1664 1665 1666 1667 1668 1669
- Thanks to Edward Loper, doctest has been massively refactored, and
  many new features were added.  Full docs will appear later.  For now
  the doctest module comments and new test cases give good coverage.
  The refactoring provides many hook points for customizing behavior
  (such as how to report errors, and how to compare expected to actual
  output).  New features include a <BLANKLINE> marker for expected
  output containing blank lines, options to produce unified or context
  diffs when actual output doesn't match expectations, an option to
  normalize whitespace before comparing, and an option to use an
  ellipsis to signify "don't care" regions of output.

1670 1671
- Tkinter now supports the wish -sync and -use options.

1672 1673 1674 1675 1676
- The following methods in time support passing of None: ctime(), gmtime(),
  and localtime().  If None is provided, the current time is used (the
  same as when the argument is omitted).
  [SF bug 658254, patch 663482]

1677 1678
- nntplib does now allow to ignore a .netrc file.

1679 1680 1681
- urllib2 now recognizes Basic authentication even if other authentication
  schemes are offered.

1682 1683
- Bug #1001053.  wave.open() now accepts unicode filenames.

1684 1685 1686 1687
- gzip.GzipFile has a new fileno() method, to retrieve the handle of the
  underlying file object (provided it has a fileno() method).  This is
  needed if you want to use os.fsync() on a GzipFile.

1688 1689
- imaplib has two new methods: deleteacl and myrights.

1690 1691 1692
- nntplib has two new methods: description and descriptions. They
  use a more RFC-compliant way of getting a newsgroup description.

1693 1694 1695 1696
- Bug #993394.  Fix a possible red herring of KeyError in 'threading' being
  raised during interpreter shutdown from a registered function with atexit
  when dummy_threading is being used.

1697 1698 1699
- Bug #857297/Patch #916874.  Fix an error when extracting a hard link
  from a tarfile.

1700
- Patch #846659.  Fix an error in tarfile.py when using
1701
  GNU longname/longlink creation.
1702

1703 1704 1705 1706 1707
- The obsolete FCNTL.py has been deleted.  The builtin fcntl module
  has been available (on platforms that support fcntl) since Python
  1.5a3, and all FCNTL.py did is export fcntl's names, after generating
  a deprecation warning telling you to use fcntl directly.

1708 1709 1710
- Several new unicode codecs are added: big5hkscs, euc_jis_2004,
  iso2022_jp_2004, shift_jis_2004.

1711 1712 1713 1714 1715 1716 1717 1718 1719 1720 1721
- Bug #788520.  Queue.{get, get_nowait, put, put_nowait} have new
  implementations, exploiting Conditions (which didn't exist at the time
  Queue was introduced).  A minor semantic change is that the Full and
  Empty exceptions raised by non-blocking calls now occur only if the
  queue truly was full or empty at the instant the queue was checked (of
  course the Queue may no longer be full or empty by the time a calling
  thread sees those exceptions, though).  Before, the exceptions could
  also be raised if it was "merely inconvenient" for the implementation
  to determine the true state of the Queue (because the Queue was locked
  by some other method in progress).

1722 1723 1724
- Bugs #979794 and #980117: difflib.get_grouped_opcodes() now handles the
  case of comparing two empty lists.  This affected both context_diff() and
  unified_diff(),
1725

1726 1727
- Bug #980938: smtplib now prints debug output to sys.stderr.

1728 1729 1730 1731
- Bug #930024: posixpath.realpath() now handles infinite loops in symlinks by
  returning the last point in the path that was not part of any loop.  Thanks
  AM Kuchling.

1732 1733 1734 1735
- Bug #980327: ntpath not handles compressing erroneous slashes between the
  drive letter and the rest of the path.  Also clearly handles UNC addresses now
  as well.  Thanks Paul Moore.

1736 1737 1738 1739
- bug #679953: zipfile.py should now work for files over 2 GB.  The packed data
  for file sizes (compressed and uncompressed) was being stored as signed
  instead of unsigned.

1740
- decimal.py now only uses signals in the IBM spec.  The other conditions are
Raymond Hettinger's avatar
Raymond Hettinger committed
1741 1742
  no longer part of the public API.

1743 1744 1745 1746
- codecs module now has two new generic APIs: encode() and decode()
  which don't restrict the return types (unlike the unicode and
  string methods of the same name).

1747
- Non-blocking SSL sockets work again; they were broken in Python 2.3.
Andrew M. Kuchling's avatar
Andrew M. Kuchling committed
1748 1749
  SF patch 945642.

1750 1751 1752 1753 1754 1755 1756 1757 1758 1759
- doctest unittest integration improvements:

  o Improved the unitest test output for doctest-based unit tests

  o Can now pass setUp and tearDown functions when creating
    DocTestSuites.

- The threading module has a new class, local, for creating objects
  that provide thread-local data.

1760 1761 1762
- Bug #990307: when keep_empty_values is True, cgi.parse_qsl()
  no longer returns spurious empty fields.

1763 1764 1765 1766 1767 1768
- Implemented bind_textdomain_codeset() in gettext module.

- Introduced in gettext module the l*gettext() family of functions,
  which return translation strings encoded in the preferred encoding,
  as informed by locale module's getpreferredencoding().

1769 1770 1771 1772 1773
- optparse module (and tests) upgraded to Optik 1.5a1.  Changes:

  - Add expansion of default values in help text: the string
    "%default" in an option's help string is expanded to str() of
    that option's default value, or "none" if no default value.
1774

1775 1776 1777 1778 1779 1780 1781 1782 1783 1784 1785 1786 1787 1788 1789 1790 1791 1792 1793 1794 1795 1796 1797 1798 1799
  - Bug #955889: option default values that happen to be strings are
    now processed in the same way as values from the command line; this
    allows generation of nicer help when using custom types.  Can
    be disabled with parser.set_process_default_values(False).

  - Bug #960515: don't crash when generating help for callback
    options that specify 'type', but not 'dest' or 'metavar'.

  - Feature #815264: change the default help format for short options
    that take an argument from e.g. "-oARG" to "-o ARG"; add
    set_short_opt_delimiter() and set_long_opt_delimiter() methods to
    HelpFormatter to allow (slight) customization of the formatting.

  - Patch #736940: internationalize Optik: all built-in user-
    targeted literal strings are passed through gettext.gettext().  (If
    you want translations (.po files), they're not included with Python
    -- you'll find them in the Optik source distribution from
    http://optik.sourceforge.net/ .)

  - Bug #878453: respect $COLUMNS environment variable for
    wrapping help output.

  - Feature #988122: expand "%prog" in the 'description' passed
    to OptionParser, just like in the 'usage' and 'version' strings.
    (This is *not* done in the 'description' passed to OptionGroup.)
1800

Anthony Baxter's avatar
Anthony Baxter committed
1801 1802 1803
C API
-----

1804 1805 1806 1807 1808 1809 1810 1811 1812 1813 1814 1815
- PyImport_ExecCodeModule() and PyImport_ExecCodeModuleEx():  if an
  error occurs while loading the module, these now delete the module's
  entry from sys.modules.  All ways of loading modules eventually call
  one of these, so this is an error-case change in semantics for all
  ways of loading modules.  In rare cases, a module loader may wish
  to keep a module object in sys.modules despite that the module's
  code cannot be executed.  In such cases, the module loader must
  arrange to reinsert the name and module object in sys.modules.
  PyImport_ReloadModule() has been changed to reinsert the original
  module object into sys.modules if the module reload fails, so that
  its visible semantics have not changed.

1816 1817 1818
- A large pile of datetime field-extraction macros is now documented,
  thanks to Anthony Tuininga (patch #986010).

1819 1820 1821
Documentation
-------------

1822
- Improved the tutorial on creating types in C.
1823 1824

  - point out the importance of reassigning data members before
Andrew M. Kuchling's avatar
Andrew M. Kuchling committed
1825
    assigning their values
1826

Andrew M. Kuchling's avatar
Andrew M. Kuchling committed
1827
  - correct my misconception about return values from visitprocs. Sigh.
1828 1829 1830

  - mention the labor saving Py_VISIT and Py_CLEAR macros.

1831
- Major rewrite of the math module docs, to address common confusions.
1832

Anthony Baxter's avatar
Anthony Baxter committed
1833 1834 1835
Tests
-----

1836
- The test data files for the decimal test suite are now installed on
1837 1838
  platforms that use the Makefile.

1839 1840 1841 1842
- SF patch 995225:  The test file testtar.tar accidentally contained
  CVS keywords (like $Id$), which could cause spurious failures in
  test_tarfile.py depending on how the test file was checked out.

Anthony Baxter's avatar
Anthony Baxter committed
1843

1844 1845 1846
What's New in Python 2.4 alpha 1?
=================================

Anthony Baxter's avatar
Anthony Baxter committed
1847
*Release date: 08-JUL-2004*
1848 1849 1850 1851

Core and builtins
-----------------

1852 1853 1854 1855 1856
- weakref.ref is now the type object also known as
  weakref.ReferenceType; it can be subclassed like any other new-style
  class.  There's less per-entry overhead in WeakValueDictionary
  objects now (one object instead of three).

Thomas Heller's avatar
Thomas Heller committed
1857 1858 1859
- Bug #951851: Python crashed when reading import table of certain
  Windows DLLs.

1860 1861
- Bug #215126.  The locals argument to eval(), execfile(), and exec now
  accept any mapping type.
1862

1863 1864 1865
- marshal now shares interned strings. This change introduces
  a new .pyc magic.

1866 1867 1868 1869
- Bug #966623. classes created with type() in an exec(, {}) don't
  have a __module__, but code in typeobject assumed it would always
  be there.

1870 1871 1872 1873
- Python no longer relies on the LC_NUMERIC locale setting to be
  the "C" locale; as a result, it no longer tries to prevent changing
  the LC_NUMERIC category.

1874 1875 1876 1877
- Bug #952807:  Unpickling pickled instances of subclasses of
  datetime.date, datetime.datetime and datetime.time could yield insane
  objects.  Thanks to Jiwon Seo for a fix.

1878 1879
- Bug #845802: Python crashes when __init__.py is a directory.

1880 1881 1882
- Unicode objects received two new methods: iswide() and width().
  These query East Asian width information, as specified in Unicode
  TR11.
1883

1884 1885 1886
- Improved the tuple hashing algorithm to give fewer collisions in
  common cases.  Fixes bug  #942952.

1887 1888
- Implemented generator expressions (PEP 289).  Coded by Jiwon Seo.

1889
- Enabled the profiling of C extension functions (and builtins) - check
Andrew M. Kuchling's avatar
Andrew M. Kuchling committed
1890
  new documentation and modified profile and bdb modules for more details
1891

1892 1893
- Set file.name to the object passed to open (instead of a new string)

1894 1895
- Moved tracebackobject into traceback.h and renamed to PyTracebackObject

1896 1897 1898
- Optimized the byte coding for multiple assignments like "a,b=b,a" and
  "a,b,c=1,2,3".  Improves their speed by 25% to 30%.

1899 1900 1901 1902
- Limit the nested depth of a tuple for the second argument to isinstance()
  and issubclass() to the recursion limit of the interpreter.
  Fixes bug  #858016 .

1903 1904 1905 1906
- Optimized dict iterators, creating separate types for each
  and having them reveal their length.  Also optimized the
  methods:  keys(), values(), and items().

1907 1908 1909 1910
- Implemented a newcode opcode, LIST_APPEND, that simplifies
  the generated bytecode for list comprehensions and further
  improves their performance (about 35%).

1911 1912 1913 1914
- Implemented rich comparisons for floats, which seems to make
  comparisons involving NaNs somewhat less surprising when the
  underlying C compiler actually implements C99 semantics.

1915 1916 1917 1918 1919
- Optimized list.extend() to save memory and no longer create
  intermediate sequences.  Also, extend() now pre-allocates the
  needed memory whenever the length of the iterable is known in
  advance -- this halves the time to extend the list.

1920 1921
- Optimized list resize operations to make fewer calls to the system
  realloc().  Significantly speeds up list appends, list pops,
Walter Dörwald's avatar
Walter Dörwald committed
1922
  list comprehensions, and the list constructor (when the input iterable
1923 1924 1925 1926
  length is not known).

- Changed the internal list over-allocation scheme.  For larger lists,
  overallocation ranged between 3% and 25%.  Now, it is a constant 12%.
1927 1928
  For smaller lists (n<8), overallocation was upto eight elements.  Now,
  the overallocation is no more than three elements -- this improves space
1929 1930
  utilization for applications that have large numbers of small lists.

1931 1932 1933
- Most list bodies now get re-used rather than freed.  Speeds up list
  instantiation and deletion by saving calls to malloc() and free().

1934 1935 1936 1937
- The dict.update() method now accepts all the same argument forms
  as the dict() constructor.  This now includes item lists and/or
  keyword arguments.

1938 1939 1940 1941
- Support for arbitrary objects supporting the read-only buffer
  interface as the co_code field of code objects (something that was
  only possible to create from C code) has been removed.

Fred Drake's avatar
Fred Drake committed
1942 1943 1944 1945 1946 1947 1948 1949 1950 1951 1952 1953 1954 1955 1956
- Made omitted callback and None equivalent for weakref.ref() and
  weakref.proxy(); the None case wasn't handled correctly in all
  cases.

- Fixed problem where PyWeakref_NewRef() and PyWeakref_NewProxy()
  assumed that initial existing entries in an object's weakref list
  would not be removed while allocating a new weakref object.  Since
  GC could be invoked at that time, however, that assumption was
  invalid.  In a truly obscure case of GC being triggered during
  creation for a new weakref object for an referent which already
  has a weakref without a callback which is only referenced from
  cyclic trash, a memory error can occur.  This consistently created a
  segfault in a debug build, but provided less predictable behavior in
  a release build.

1957 1958 1959
- input() builtin function now respects compiler flags such as
  __future__ statements.  SF patch 876178.

1960
- Removed PendingDeprecationWarning from apply().  apply() remains
1961
  deprecated, but the nuisance warning will not be issued.
1962

1963 1964 1965 1966 1967 1968 1969 1970 1971 1972
- At Python shutdown time (Py_Finalize()), 2.3 called cyclic garbage
  collection twice, both before and after tearing down modules.  The
  call after tearing down modules has been disabled, because too much
  of Python has been torn down then for __del__ methods and weakref
  callbacks to execute sanely.  The most common symptom was a sequence
  of uninformative messages on stderr when Python shut down, produced
  by threads trying to raise exceptions, but unable to report the nature
  of their problems because too much of the sys module had already been
  destroyed.

1973 1974 1975 1976 1977 1978 1979 1980 1981
- Removed FutureWarnings related to hex/oct literals and conversions
  and left shifts.  (Thanks to Kalle Svensson for SF patch 849227.)
  This addresses most of the remaining semantic changes promised by
  PEP 237, except for repr() of a long, which still shows the trailing
  'L'.  The PEP appears to promise warnings for operations that
  changed semantics compared to Python 2.3, but this is not
  implemented; we've suffered through enough warnings related to
  hex/oct literals and I think it's best to be silent now.

1982 1983 1984 1985
- For str and unicode objects, the ljust(), center(), and rjust()
  methods now accept an optional argument specifying a fill
  character other than a space.

1986 1987 1988 1989 1990 1991 1992 1993 1994
- When method objects have an attribute that can be satisfied either
  by the function object or by the method object, the function
  object's attribute usually wins.  Christian Tismer pointed out that
  that this is really a mistake, because this only happens for special
  methods (like __reduce__) where the method object's version is
  really more appropriate than the function's attribute.  So from now
  on, all method attributes will have precedence over function
  attributes with the same name.

1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008
- Critical bugfix, for SF bug 839548:  if a weakref with a callback,
  its callback, and its weakly referenced object, all became part of
  cyclic garbage during a single run of garbage collection, the order
  in which they were torn down was unpredictable.  It was possible for
  the callback to see partially-torn-down objects, leading to immediate
  segfaults, or, if the callback resurrected garbage objects, to
  resurrect insane objects that caused segfaults (or other surprises)
  later.  In one sense this wasn't surprising, because Python's cyclic gc
  had no knowledge of Python's weakref objects.  It does now.  When
  weakrefs with callbacks become part of cyclic garbage now, those
  weakrefs are cleared first.  The callbacks don't trigger then,
  preventing the problems.  If you need callbacks to trigger, then just
  as when cyclic gc is not involved, you need to write your code so
  that weakref objects outlive the objects they weakly reference.
2009

2010 2011 2012 2013 2014 2015
- Critical bugfix, for SF bug 840829:  if cyclic garbage collection
  happened to occur during a weakref callback for a new-style class
  instance, subtle memory corruption was the result (in a release build;
  in a debug build, a segfault occurred reliably very soon after).
  This has been repaired.

2016 2017 2018 2019
- Compiler flags set in PYTHONSTARTUP are now active in __main__.

- Added two builtin types, set() and frozenset().

2020 2021 2022
- Added a reversed() builtin function that returns a reverse iterator
  over a sequence.

2023
- Added a sorted() builtin function that returns a new sorted list
Tim Peters's avatar
Tim Peters committed
2024
  from any iterable.
2025

2026 2027
- CObjects are now mutable (on the C level) through PyCObject_SetVoidPtr.

2028 2029 2030 2031 2032 2033 2034 2035 2036
- list.sort() now supports three keyword arguments:  cmp, key, and reverse.
  The key argument can be a function of one argument that extracts a
  comparison key from the original record:  mylist.sort(key=str.lower).
  The reverse argument is a boolean value and if True will change the
  sort order as if the comparison arguments were reversed.  In addition,
  the documentation has been amended to provide a guarantee that all sorts
  starting with Py2.3 are guaranteed to be stable (the relative order of
  records with equal keys is unchanged).

2037 2038 2039 2040
- Added test whether wchar_t is signed or not. A signed wchar_t is not
  usable as internal unicode type base for Py_UNICODE since the
  unicode implementation assumes an unsigned type.

2041 2042 2043 2044 2045
- Fixed a bug in the cache of length-one Unicode strings that could
  lead to a seg fault.  The specific problem occurred when an earlier,
  non-fatal error left an uninitialized Unicode object in the
  freelist.

2046 2047 2048
- The % formatting operator now supports '%F' which is equivalent to
  '%f'.  This has always been documented but never implemented.

2049 2050 2051
- complex(obj) could leak a little memory if obj wasn't a string or
  number.

2052 2053 2054
- zip() with no arguments now returns an empty list instead of raising
  a TypeError exception.

2055 2056 2057
- obj.__contains__() now returns True/False instead of 1/0.  SF patch
  820195.

2058 2059 2060 2061
- Python no longer tries to be smart about recursive comparisons.
  When comparing containers with cyclic references to themselves it
  will now just hit the recursion limit.  See SF patch 825639.

Andrew M. Kuchling's avatar
Andrew M. Kuchling committed
2062
- str and unicode builtin types now have an rsplit() method that is
2063 2064
  same as split() except that it scans the string from the end
  working towards the beginning.  See SF feature request 801847.
2065

2066 2067 2068 2069
- Fixed a bug in object.__reduce_ex__ when using protocol 2.  Failure
  to clear the error when attempts to get the __getstate__ attribute
  fail caused intermittent errors and odd behavior.

2070 2071
- buffer objects based on other objects no longer cache a pointer to
  the data and the data length.  Instead, the appropriate tp_as_buffer
Tim Peters's avatar
Tim Peters committed
2072
  method is called as necessary.
2073

Tim Peters's avatar
Tim Peters committed
2074
- fixed: if a file is opened with an explicit buffer size >= 1, repeated
2075 2076 2077
  close() calls would attempt to free() the buffer already free()ed on
  the first call.

2078

2079 2080 2081
Extension modules
-----------------

2082 2083 2084
- Added socket.getservbyport(), and make the second argument in
  getservbyname() and getservbyport() optional.

2085 2086 2087 2088 2089 2090 2091
- time module code that deals with input POSIX timestamps will now raise
  ValueError if more than a second is lost in precision when the
  timestamp is cast to the platform C time_t type.  There's no chance
  that the platform will do anything sensible with the result in such
  cases.  This includes ctime(), localtime() and gmtime().  Assorted
  fromtimestamp() and utcfromtimestamp() methods in the datetime module
  were also protected.  Closes bugs #919012 and 975996.
2092

2093 2094
- fcntl.ioctl now warns if the mutate flag is not specified.

2095 2096
- nt now properly allows to refer to UNC roots, e.g. in nt.stat().

2097 2098 2099
- the weakref module now supports additional objects:  array.array,
  sre.pattern_objects, file objects, and sockets.

2100 2101 2102
- operator.isMappingType() and operator.isSequenceType() now give
  fewer false positives.

2103 2104 2105
- socket.sslerror is now a subclass of socket.error .  Also added
  socket.error to the socket module's C API.

Andrew M. Kuchling's avatar
Andrew M. Kuchling committed
2106
- Bug #920575: A problem where the _locale module segfaults on
2107 2108
  nl_langinfo(ERA) caused by GNU libc's illegal NULL return is fixed.

2109
- array objects now support the copy module.  Also, their resizing
Andrew M. Kuchling's avatar
Andrew M. Kuchling committed
2110
  scheme has been updated to match that used for list objects.  This improves
2111
  the performance (speed and memory usage) of append() operations.
2112 2113
  Also, array.array() and array.extend() now accept any iterable argument
  for repeated appends without needing to create another temporary array.
2114

2115 2116 2117 2118 2119
- cStringIO.writelines() now accepts any iterable argument and writes
  the lines one at a time rather than joining them and writing once.
  Made a parallel change to StringIO.writelines().  Saves memory and
  makes suitable for use with generator expressions.

2120 2121 2122 2123 2124 2125 2126
- time.strftime() now checks that the values in its time tuple argument
  are within the proper boundaries to prevent possible crashes from the
  platform's C library implementation of strftime().  Can possibly
  break code that uses values outside the range that didn't cause
  problems previously (such as sitting day of year to 0).  Fixes bug
  #897625.

2127 2128 2129
- The socket module now supports Bluetooth sockets, if the
  system has <bluetooth/bluetooth.h>

2130 2131 2132 2133 2134 2135 2136
- Added a collections module containing a new datatype, deque(),
  offering high-performance, thread-safe, memory friendly appends
  and pops on either side of the deque.

- Several modules now take advantage of collections.deque() for
  improved performance:  Queue, mutex, shlex, threading, and pydoc.

2137 2138 2139 2140 2141
- The operator module has two new functions, attrgetter() and
  itemgetter() which are useful for creating fast data extractor
  functions for map(), list.sort(), itertools.groupby(), and
  other functions that expect a function argument.

2142 2143
- socket.SHUT_{RD,WR,RDWR} was added.

2144 2145
- os.getsid was added.

2146 2147 2148 2149
- The pwd module incorrectly advertised its struct type as
  struct_pwent; this has been renamed to struct_passwd.  (The old name
  is still supported for backwards compatibility.)

Fred Drake's avatar
Fred Drake committed
2150 2151
- The xml.parsers.expat module now provides Expat 1.95.7.

2152 2153
- socket.IPPROTO_IPV6 was added.

2154 2155
- readline.clear_history was added.

2156 2157
- select.select() now accepts sequences for its first three arguments.

2158 2159
- cStringIO now supports the f.closed attribute.

2160 2161
- The signal module now exposes SIGRTMIN and SIGRTMAX (if available).

2162 2163
- curses module now supports use_default_colors().  [patch #739124]

Anthony Baxter's avatar
Anthony Baxter committed
2164 2165 2166 2167
- Bug #811028: ncurses.h breakage on FreeBSD/MacOS X

- Bug #814613: INET_ADDRSTRLEN fix needed for all compilers on SGI

2168 2169 2170 2171
- Implemented non-recursive SRE matching scheme (#757624).

- Implemented (?(id/name)yes|no) support in SRE (#572936).

Raymond Hettinger's avatar
Raymond Hettinger committed
2172 2173 2174 2175 2176 2177 2178 2179 2180 2181 2182 2183
- random.seed() with no arguments or None uses time.time() as a default
  seed.  Modified to match Py2.2 behavior and use fractional seconds so
  that successive runs are more likely to produce different sequences.

- random.Random has a new method, getrandbits(k), which returns an int
  with k random bits.  This method is now an optional part of the API
  for user defined generators.  Any generator that defines genrandbits()
  can now use randrange() for ranges with a length >= 2**53.  Formerly,
  randrange would return only even numbers for ranges that large (see
  SF bug #812202).  Generators that do not define genrandbits() now
  issue a warning when randrange() is called with a range that large.

2184 2185 2186 2187
- itertools has a new function, groupby() for aggregating iterables
  into groups sharing the same key (as determined by a key function).
  It offers some of functionality of SQL's groupby keyword and of
  the Unix uniq filter.
2188

Andrew M. Kuchling's avatar
Andrew M. Kuchling committed
2189
- itertools now has a new tee() function which produces two independent
Raymond Hettinger's avatar
Raymond Hettinger committed
2190 2191 2192 2193 2194
  iterators from a single iterable.

- itertools.izip() with no arguments now returns an empty iterator instead
  of raising a TypeError exception.

2195 2196 2197
- Fixed #853061: allow BZ2Compressor.compress() to receive an empty string
  as parameter.

2198 2199
Library
-------
2200

Armin Rigo's avatar
Armin Rigo committed
2201 2202 2203 2204 2205
- Added a new module: cProfile, a C profiler with the same interface as the
  profile module.  cProfile avoids some of the drawbacks of the hotshot
  profiler and provides a bit more information than the other two profilers.
  Based on "lsprof" (patch #1212837).

2206 2207
- Bug #1266283: The new function "lexists" is now in os.path.__all__.

2208 2209 2210
- Bug #981530: Fix UnboundLocalError in shutil.rmtree().  This affects
  the documented behavior: the function passed to the onerror()
  handler can now also be os.listdir.
2211

2212 2213 2214
- Bug #754449: threading.Thread objects no longer mask exceptions raised during
  interpreter shutdown with another exception from attempting to handle the
  original exception.
2215

Raymond Hettinger's avatar
Raymond Hettinger committed
2216
- Added decimal.py per PEP 327.
2217

2218 2219
- Bug #981299: rsync is now a recognized protocol in urlparse that uses a
  "netloc" portion of a URL.
2220

2221 2222 2223
- Bug #919012: shutil.move() will not try to move a directory into itself.
  Thanks Johannes Gijsbers.

2224 2225
- Bug #934282: pydoc.stripid() is now case-insensitive.  Thanks Robin Becker.

2226
- Bug #823209:  cmath.log() now takes an optional base argument so that its
2227
  API matches math.log().
2228

2229
- Bug #957381: distutils bdist_rpm no longer fails on recent RPM versions
2230
  that generate a -debuginfo.rpm
2231

2232 2233
- os.path.devnull has been added for all supported platforms.

2234 2235 2236
- Fixed #877165: distutils now picks the right C++ compiler command
  on cygwin and mingw32.

2237 2238
- urllib.urlopen().readline() now handles HTTP/0.9 correctly.

2239 2240 2241
- refactored site.py into functions.  Also wrote regression tests for the
  module.

2242 2243 2244
- The distutils install command now supports the --home option and
  installation scheme for all platforms.

2245
- asyncore.loop now has a repeat count parameter that defaults to
Michael W. Hudson's avatar
tyop  
Michael W. Hudson committed
2246
  looping forever.
2247

Tim Peters's avatar
Tim Peters committed
2248 2249 2250 2251
- The distutils sdist command now ignores all .svn directories, in
  addition to CVS and RCS directories.  .svn directories hold
  administrative files for the Subversion source control system.

2252 2253 2254 2255
- Added a new module: cookielib.  Automatic cookie handling for HTTP
  clients.  Also, support for cookielib has been added to urllib2, so
  urllib2.urlopen() can transparently handle cookies.

2256 2257
- stringprep.py now uses built-in set() instead of sets.Set().

2258 2259
- Bug #876278: Unbounded recursion in modulefinder

2260 2261 2262
- Bug #780300: Swap public and system ID in LexicalHandler.startDTD.
  Applications relying on the wrong order need to be corrected.

2263 2264 2265 2266
- Bug #926075: Fixed a bug that returns a wrong pattern object
  for a string or unicode object in sre.compile() when a different
  type pattern with the same value exists.

2267 2268 2269
- Added countcallers arg to trace.Trace class (--trackcalls command line arg
  when run from the command prompt).

2270 2271 2272
- Fixed a caching bug in platform.platform() where the argument of 'terse' was
  not taken into consideration when caching value.

2273 2274 2275
- Added two new command-line arguments for profile (output file and
  default sort).

2276 2277
- Added global runctx function to profile module

2278 2279
- Add hlist missing entryconfigure and entrycget methods.

2280 2281
- The ptcp154 codec was added for Kazakh character set support.

2282 2283
- Support non-anonymous ftp URLs in urllib2.

Andrew M. Kuchling's avatar
Andrew M. Kuchling committed
2284
- The encodings package will now apply codec name aliases
2285 2286 2287 2288 2289 2290
  first before starting to try the import of the codec module.
  This simplifies overriding built-in codecs with external
  packages, e.g. the included CJK codecs with the JapaneseCodecs
  package, by adjusting the aliases dictionary in encodings.aliases
  accordingly.

2291 2292 2293
- base64 now supports RFC 3548 Base16, Base32, and Base64 encoding and
  decoding standards.

2294 2295 2296 2297
- urllib2 now supports processors.  A processor is a handler that
  implements an xxx_request or xxx_response method.  These methods are
  called for all requests.

2298 2299 2300
- distutils compilers now compile source files in the same order as
  they are passed to the compiler.

2301 2302 2303
- pprint.pprint() and pprint.pformat() now have additional parameters
  indent, width and depth.

2304 2305 2306
- Patch #750542: pprint now will pretty print subclasses of list, tuple
  and dict too, as long as they don't overwrite __repr__().

2307 2308 2309
- Bug #848614: distutils' msvccompiler fails to find the MSVC6
  compiler because of incomplete registry entries.

2310 2311
- httplib.HTTP.putrequest now offers to omit the implicit Accept-Encoding.

2312 2313
- Patch #841977: modulefinder didn't find extension modules in packages

2314 2315
- imaplib.IMAP4.thread was added.

2316 2317 2318
- Plugged a minor hole in tempfile.mktemp() due to the use of
  os.path.exists(), switched to using os.lstat() directly if possible.

2319
- bisect.py and heapq.py now have underlying C implementations
2320 2321
  for better performance.

2322
- heapq.py has two new functions, nsmallest() and nlargest().
2323

Neil Schemenauer's avatar
Neil Schemenauer committed
2324 2325 2326
- traceback.format_exc has been added (similar to print_exc but it returns
  a string).

2327 2328
- xmlrpclib.MultiCall has been added.

2329 2330
- poplib.POP3_SSL has been added.

2331 2332
- tmpfile.mkstemp now returns an absolute path even if dir is relative.

2333 2334
- urlparse is RFC 2396 compliant.

2335 2336 2337 2338
- The fieldnames argument to the csv module's DictReader constructor is now
  optional.  If omitted, the first row of the file will be used as the
  list of fieldnames.

2339 2340 2341
- encodings.bz2_codec was added for access to bz2 compression
  using "a long string".encode('bz2')

Guido van Rossum's avatar
Guido van Rossum committed
2342 2343
- Various improvements to unittest.py, realigned with PyUnit CVS.

2344 2345 2346
- dircache now passes exceptions to the caller, instead of returning
  empty lists.

Raymond Hettinger's avatar
Raymond Hettinger committed
2347 2348 2349 2350
- The bsddb module and dbhash module now support the iterator and
  mapping protocols which make them more substitutable for dictionaries
  and shelves.

2351 2352 2353
- The csv module's DictReader and DictWriter classes now accept keyword
  arguments.  This was an omission in the initial implementation.

2354
- The email package handles some RFC 2231 parameters with missing
2355 2356
  CHARSET fields better.  It also includes a patch to parameter
  parsing when semicolons appear inside quotes.
2357

Raymond Hettinger's avatar
Raymond Hettinger committed
2358 2359
- sets.py now runs under Py2.2.  In addition, the argument restrictions
  for most set methods (but not the operators) have been relaxed to
2360
  allow any iterable.
2361

2362
- _strptime.py now has a behind-the-scenes caching mechanism for the most
2363 2364 2365
  recent TimeRE instance used along with the last five unique directive
  patterns.  The overall module was also made more thread-safe.

2366 2367 2368
- random.cunifvariate() and random.stdgamma() were deprecated in Py2.3
  and removed in Py2.4.

Anthony Baxter's avatar
Anthony Baxter committed
2369 2370
- Bug #823328: urllib2.py's HTTP Digest Auth support works again.

2371 2372
- Patch #873597: CJK codecs are imported into rank of default codecs.

2373 2374 2375
Tools/Demos
-----------

Skip Montanaro's avatar
Skip Montanaro committed
2376 2377 2378
- A hotshotmain script was added to the Tools/scripts directory that
  makes it easy to run a script under control of the hotshot profiler.

2379 2380
- The db2pickle and pickle2db scripts can now dump/load gdbm files.

2381 2382 2383 2384 2385
- The file order on the command line of the pickle2db script was reversed.
  It is now [ picklefile ] dbfile.  This provides better symmetry with
  db2pickle.  The file arguments to both scripts are now source followed by
  destination in situations where both files are given.

2386 2387 2388 2389 2390
- The pydoc script will display a link to the module documentation for
  modules determined to be part of the core distribution.  The documentation
  base directory defaults to http://www.python.org/doc/current/lib/ but can
  be changed by setting the PYTHONDOCS environment variable.

2391 2392
- texcheck.py now detects double word errors.

2393 2394 2395 2396 2397 2398
- md5sum.py mistakenly opened input files in text mode by default, a
  silent and dangerous change from previous releases.  It once again
  opens input files in binary mode by default.  The -t and -b flags
  remain for compatibility with the 2.3 release, but -b is the default
  now.

2399 2400 2401 2402 2403 2404
- py-electric-colon now works when pending-delete/delete-selection mode is
  in effect

- py-help-at-point is no longer bound to the F1 key - it's still bound to
  C-c C-h

Barry Warsaw's avatar
Barry Warsaw committed
2405 2406 2407
- Pynche was fixed to not crash when there is no ~/.pynche file and no
  -d option was given.

2408 2409 2410
Build
-----

2411 2412 2413
- Bug #978645: Modules/getpath.c now builds properly in --disable-framework
  build under OS X.

2414 2415 2416
- Profiling using gprof is now available if Python is configured with
  --enable-profiling.

2417 2418 2419
- Profiling the VM using the Pentium TSC is now possible if Python
  is configured --with-tsc.

2420 2421 2422
- In order to find libraries, setup.py now also looks in /lib64, for use
  on AMD64.

2423 2424 2425
- Bug #934635: Fixed a bug where the configure script couldn't detect
  getaddrinfo() properly if the KAME stack had SCTP support.

2426 2427 2428
- Support for missing ANSI C header files (limits.h, stddef.h, etc) was
  removed.

2429 2430 2431 2432 2433
- Systems requiring the D4, D6 or D7 variants of pthreads are no longer
  supported (see PEP 11).

- Universal newline support can no longer be disabled (see PEP 11).

2434 2435 2436 2437 2438 2439 2440 2441
- Support for DGUX, SunOS 4, IRIX 4 and Minix was removed (see PEP 11).

- Support for systems requiring --with-dl-dld or --with-sgi-dl was removed
  (see PEP 11).

- Tests for sizeof(char) were removed since ANSI C mandates that
  sizeof(char) must be 1.

2442 2443 2444
C API
-----

2445 2446 2447 2448
- Thanks to Anthony Tuininga, the datetime module now supplies a C API
  containing type-check macros and constructors.  See new docs in the
  Python/C API Reference Manual for details.

2449 2450 2451 2452 2453
- Private function _PyTime_DoubleToTimet added, to convert a Python
  timestamp (C double) to platform time_t with some out-of-bounds
  checking.  Declared in new header file timefuncs.h.  It would be
  good to expose some other internal timemodule.c functions there.

2454 2455 2456
- New public functions PyEval_EvaluateFrame and PyGen_New to expose
  generator objects.

2457 2458
- New public functions Py_IncRef() and Py_DecRef(), exposing the
  functionality of the Py_XINCREF() and Py_XDECREF macros. Useful for
2459 2460
  runtime dynamic embedding of Python.  See patch #938302, by Bob
  Ippolito.
2461

2462 2463 2464
- Added a new macro, PySequence_Fast_ITEMS, which retrieves a fast sequence's
  underlying array of PyObject pointers.  Useful for high speed looping.

2465 2466 2467 2468 2469 2470
- Created a new method flag, METH_COEXIST, which causes a method to be loaded
  even if already defined by a slot wrapper.  This allows a __contains__
  method, for example, to co-exist with a defined sq_contains slot.  This
  is helpful because the PyCFunction can take advantage of optimized calls
  whenever METH_O or METH_NOARGS flags are defined.

2471 2472 2473 2474
- Added a new function, PyDict_Contains(d, k) which is like
  PySequence_Contains() but is specific to dictionaries and executes
  about 10% faster.

2475 2476 2477
- Added three new macros: Py_RETURN_NONE, Py_RETURN_TRUE, and Py_RETURN_FALSE.
  Each return the singleton they mention after Py_INCREF()ing them.

2478 2479 2480 2481 2482
- Added a new function, PyTuple_Pack(n, ...) for constructing tuples from a
  variable length argument list of Python objects without having to invoke
  the more complex machinery of Py_BuildValue().  PyTuple_Pack(3, a, b, c)
  is equivalent to Py_BuildValue("(OOO)", a, b, c).

2483 2484 2485
Windows
-------

2486 2487 2488 2489 2490
- The _winreg module could segfault when reading very large registry
  values, due to unchecked alloca() calls (SF bug 851056).  The fix is
  uses either PyMem_Malloc(n) or PyString_FromStringAndSize(NULL, n),
  as appropriate, followed by a size check.

2491 2492 2493 2494
- file.truncate() could misbehave if the file was open for update
  (modes r+, rb+, w+, wb+), and the most recent file operation before
  the truncate() call was an input operation.  SF bug 801631.

2495

2496 2497 2498 2499 2500 2501 2502 2503
What's New in Python 2.3 final?
===============================

*Release date: 29-Jul-2003*

IDLE
----

2504 2505 2506 2507 2508
- Bug 778400:  IDLE hangs when selecting "Edit with IDLE" from explorer.
  This was unique to Windows, and was fixed by adding an -n switch to
  the command the Windows installer creates to execute "Edit with IDLE"
  context-menu actions.

2509 2510 2511 2512 2513 2514 2515 2516 2517 2518 2519
- IDLE displays a new message upon startup:  some "personal firewall"
  kinds of programs (for example, ZoneAlarm) open a dialog of their
  own when any program opens a socket.  IDLE does use sockets, talking
  on the computer's internal loopback interface.  This connection is not
  visible on any external interface and no data is sent to or received
  from the Internet.  So, if you get such a dialog when opening IDLE,
  asking whether to let pythonw.exe talk to address 127.0.0.1, say yes,
  and rest assured no communication external to your machine is taking
  place.  If you don't allow it, IDLE won't be able to start.


2520 2521 2522
What's New in Python 2.3 release candidate 2?
=============================================

2523
*Release date: 24-Jul-2003*
2524 2525 2526 2527

Core and builtins
-----------------

2528
- It is now possible to import from zipfiles containing additional
Barry Warsaw's avatar
Barry Warsaw committed
2529
  data bytes before the zip compatible archive.  Zipfiles containing a
2530 2531
  comment at the end are still unsupported.

2532 2533 2534 2535 2536 2537 2538
Extension modules
-----------------

- A longstanding bug in the parser module's initialization could cause
  fatal internal refcount confusion when the module got initialized more
  than once.  This has been fixed.

2539 2540 2541 2542 2543
- Fixed memory leak in pyexpat; using the parser's ParseFile() method
  with open files that aren't instances of the standard file type
  caused an instance of the bound .read() method to be leaked on every
  call.

Barry Warsaw's avatar
Barry Warsaw committed
2544 2545
- Fixed some leaks in the locale module.

2546 2547 2548
Library
-------

Barry Warsaw's avatar
Barry Warsaw committed
2549 2550 2551 2552 2553 2554 2555 2556
- Lib/encodings/rot_13.py when used as a script, now more properly
  uses the first Python interpreter on your path.

- Removed caching of TimeRE (and thus LocaleTime) in _strptime.py to
  fix a locale related bug in the test suite.  Although another patch
  was needed to actually fix the problem, the cache code was not
  restored.

2557 2558 2559 2560
IDLE
----

- Calltips patches.
Barry Warsaw's avatar
Barry Warsaw committed
2561

2562 2563 2564
Build
-----

Barry Warsaw's avatar
Barry Warsaw committed
2565 2566 2567
- For MacOSX, added -mno-fused-madd to BASECFLAGS to fix test_coercion
  on Panther (OSX 10.3).

2568 2569 2570 2571 2572 2573
C API
-----

Windows
-------

2574 2575 2576
- The tempfile module could do insane imports on Windows if PYTHONCASEOK
  was set, making temp file creation impossible.  Repaired.

Barry Warsaw's avatar
Barry Warsaw committed
2577 2578
- Add a patch to workaround pthread_sigmask() bugs in Cygwin.

2579 2580 2581
Mac
---

Barry Warsaw's avatar
Barry Warsaw committed
2582 2583 2584 2585 2586 2587 2588
- Various fixes to pimp.

- Scripts runs with pythonw no longer had full window manager access.

- Don't force boot-disk-only install, for reasons unknown it causes
  more problems than it solves.

2589

Jeremy Hylton's avatar
Jeremy Hylton committed
2590 2591
What's New in Python 2.3 release candidate 1?
=============================================
2592

Jeremy Hylton's avatar
Jeremy Hylton committed
2593 2594
*Release date: 18-Jul-2003*

2595 2596 2597
Core and builtins
-----------------

2598 2599 2600
- The new function sys.getcheckinterval() returns the last value set
  by sys.setcheckinterval().

Jeremy Hylton's avatar
Jeremy Hylton committed
2601 2602
- Several bugs in the symbol table phase of the compiler have been
  fixed.  Errors could be lost and compilation could fail without
Jeremy Hylton's avatar
Jeremy Hylton committed
2603
  reporting an error.  SF patch 763201.
Jeremy Hylton's avatar
Jeremy Hylton committed
2604 2605 2606 2607

- The interpreter is now more robust about importing the warnings
  module.  In an executable generated by freeze or similar programs,
  earlier versions of 2.3 would fail if the warnings module could
Jeremy Hylton's avatar
Jeremy Hylton committed
2608
  not be found on the file system.  Fixes SF bug 771097.
Jeremy Hylton's avatar
Jeremy Hylton committed
2609 2610 2611

- A warning about assignments to module attributes that shadow
  builtins, present in earlier releases of 2.3, has been removed.
2612

Jeremy Hylton's avatar
Jeremy Hylton committed
2613 2614 2615 2616 2617
- It is not possible to create subclasses of builtin types like str
  and tuple that define an itemsize.  Earlier releases of Python 2.3
  allowed this by mistake, leading to crashes and other problems.

- The thread_id is now initialized to 0 in a non-thread build.  SF bug
Jeremy Hylton's avatar
Jeremy Hylton committed
2618 2619 2620
  770247.

- SF bug 762891: "del p[key]" on proxy object no longer raises SystemError.
Jeremy Hylton's avatar
Jeremy Hylton committed
2621

2622 2623 2624 2625 2626 2627 2628 2629
Extension modules
-----------------

- weakref.proxy() can now handle "del obj[i]" for proxy objects
  defining __delitem__.  Formerly, it generated a SystemError.

- SSL no longer crashes the interpreter when the remote side disconnects.

2630 2631
- On Unix the mmap module can again be used to map device files.

2632 2633 2634
- time.strptime now exclusively uses the Python implementation
  contained within the _strptime module.

Jeremy Hylton's avatar
Jeremy Hylton committed
2635 2636 2637 2638
- The print slot of weakref proxy objects was removed, because it was
  not consistent with the object's repr slot.

- The mmap module only checks file size for regular files, not
Jeremy Hylton's avatar
Jeremy Hylton committed
2639 2640 2641 2642 2643
  character or block devices.  SF patch 708374.

- The cPickle Pickler garbage collection support was fixed to traverse
  the find_class attribute, if present.

2644
- There are several fixes for the bsddb3 wrapper module.
Jeremy Hylton's avatar
Jeremy Hylton committed
2645 2646 2647 2648 2649

  bsddb3 no longer crashes if an environment is closed before a cursor
  (SF bug 763298).

  The DB and DBEnv set_get_returns_none function was extended to take
Barry Warsaw's avatar
Barry Warsaw committed
2650 2651 2652
  a level instead of a boolean flag.  The new level 2 means that in
  addition, cursor.set()/.get() methods return None instead of raising
  an exception.
Jeremy Hylton's avatar
Jeremy Hylton committed
2653 2654 2655

  A typo was fixed in DBCursor.join_item(), preventing a crash.

2656 2657 2658
Library
-------

Jeremy Hylton's avatar
Jeremy Hylton committed
2659 2660
- distutils now supports MSVC 7.1

2661 2662 2663 2664 2665 2666 2667 2668 2669
- doctest now examines all docstrings by default.  Previously, it would
  skip over functions with private names (as indicated by the underscore
  naming convention).  The old default created too much of a risk that
  user tests were being skipped inadvertently.  Note, this change could
  break code in the unlikely case that someone had intentionally put
  failing tests in the docstrings of private functions.  The breakage
  is easily fixable by specifying the old behavior when calling testmod()
  or Tester().

Jeremy Hylton's avatar
Jeremy Hylton committed
2670 2671 2672 2673 2674 2675 2676 2677 2678
- There were several fixes to the way dumbdbms are closed.  It's vital
  that a dumbdbm database be closed properly, else the on-disk data
  and directory files can be left in mutually inconsistent states.
  dumbdbm.py's _Database.__del__() method attempted to close the
  database properly, but a shutdown race in _Database._commit() could
  prevent this from working, so that a program trusting __del__() to
  get the on-disk files in synch could be badly surprised.  The race
  has been repaired.  A sync() method was also added so that shelve
  can guarantee data is written to disk.
2679

Jeremy Hylton's avatar
Jeremy Hylton committed
2680
  The close() method can now be called more than once without complaint.
2681

2682 2683 2684
- The classes in threading.py are now new-style classes.  That they
  weren't before was an oversight.

Jeremy Hylton's avatar
Jeremy Hylton committed
2685 2686 2687
- The urllib2 digest authentication handlers now define the correct
  auth_header.  The earlier versions would fail at runtime.

2688 2689 2690
- SF bug 763023: fix uncaught ZeroDivisionError in difflib ratio methods
  when there are no lines.

2691 2692 2693
- SF bug 763637: fix exception in Tkinter with after_cancel
  which could occur with Tk 8.4

Jeremy Hylton's avatar
Jeremy Hylton committed
2694 2695 2696
- SF bug 770601: CGIHTTPServer.py now passes the entire environment
  to child processes.

Jeremy Hylton's avatar
Jeremy Hylton committed
2697 2698 2699 2700 2701 2702 2703 2704 2705 2706 2707 2708 2709 2710 2711 2712 2713 2714 2715 2716 2717 2718 2719 2720 2721 2722 2723
- SF bug 765238: add filter to fnmatch's __all__.

- SF bug 748201: make time.strptime() error messages more helpful.

- SF patch 764470: Do not dump the args attribute of a Fault object in
  xmlrpclib.

- SF patch 549151: urllib and urllib2 now redirect POSTs on 301
  responses.

- SF patch 766650: The whichdb module was fixed to recognize dbm files
  generated by gdbm on OS/2 EMX.

- SF bugs 763047 and 763052: fixes bug of timezone value being left as
  -1 when ``time.tzname[0] == time.tzname[1] and not time.daylight``
  is true when it should only when time.daylight is true.

- SF bug 764548: re now allows subclasses of str and unicode to be
  used as patterns.

- SF bug 763637: In Tkinter, change after_cancel() to handle tuples
  of varying sizes.  Tk 8.4 returns a different number of values
  than Tk 8.3.

- SF bug 763023: difflib.ratio() did not catch zero division.

- The Queue module now has an __all__ attribute.
Jeremy Hylton's avatar
Jeremy Hylton committed
2724

2725 2726 2727
Tools/Demos
-----------

Jeremy Hylton's avatar
Jeremy Hylton committed
2728 2729
- See Lib/idlelib/NEWS.txt for IDLE news.

Neal Norwitz's avatar
Neal Norwitz committed
2730 2731
- SF bug 753592: webchecker/wsgui now handles user supplied directories.

Jeremy Hylton's avatar
Jeremy Hylton committed
2732
- The trace.py script has been removed.  It is now in the standard library.
Jeremy Hylton's avatar
Jeremy Hylton committed
2733

2734 2735 2736
Build
-----

Jeremy Hylton's avatar
Jeremy Hylton committed
2737 2738
- Python now compiles with -fno-strict-aliasing if possible (SF bug 766696).

Jeremy Hylton's avatar
Jeremy Hylton committed
2739 2740
- The socket module compiles on IRIX 6.5.10.

Jeremy Hylton's avatar
Jeremy Hylton committed
2741 2742
- An irix64 system is treated the same way as an irix6 system (SF
  patch 764560).
Jeremy Hylton's avatar
Jeremy Hylton committed
2743

Jeremy Hylton's avatar
Jeremy Hylton committed
2744 2745 2746
- Several definitions were missing on FreeBSD 5.x unless the
  __BSD_VISIBLE symbol was defined.  configure now defines it as
  needed.
Jeremy Hylton's avatar
Jeremy Hylton committed
2747

2748 2749 2750
C API
-----

Jeremy Hylton's avatar
Jeremy Hylton committed
2751 2752 2753
- Unicode objects now support mbcs as a built-in encoding, so the C
  API can use it without deferring to the encodings package.

2754 2755 2756
Windows
-------

Jeremy Hylton's avatar
Jeremy Hylton committed
2757 2758 2759 2760 2761 2762 2763 2764 2765 2766 2767 2768 2769 2770 2771
- The Windows implementation of PyThread_start_new_thread() never
  checked error returns from Windows functions correctly.  As a result,
  it could claim to start a new thread even when the Microsoft
  _beginthread() function failed (due to "too many threads" -- this is
  on the order of thousands when it happens).  In these cases, the
  Python exception ::

      thread.error: can't start new thread

  is raised now.

- SF bug 766669: Prevent a GPF on interpreter exit when sockets are in
  use.  The interpreter now calls WSACleanup() from Py_Finalize()
  instead of from DLL teardown.

2772 2773 2774
Mac
---

Jeremy Hylton's avatar
Jeremy Hylton committed
2775
- Bundlebuilder now inherits default values in the right way.  It was
Walter Dörwald's avatar
Walter Dörwald committed
2776
  previously possible for app bundles to get a type of "BNDL" instead
Jeremy Hylton's avatar
Jeremy Hylton committed
2777 2778 2779 2780 2781
  of "APPL."  Other improvements include, a --build-id option to
  specify the CFBundleIdentifier and using the --python option to set
  the executable in the bundle.

- Fixed two bugs in MacOSX framework handling.
2782

2783 2784 2785 2786 2787
- pythonw did not allow user interaction in 2.3rc1, this has been fixed.

- Python is now compiled with -mno-fused-madd, making all tests pass
  on Panther.

Tim Peters's avatar
Tim Peters committed
2788 2789 2790
What's New in Python 2.3 beta 2?
================================

2791
*Release date: 29-Jun-2003*
Tim Peters's avatar
Tim Peters committed
2792 2793 2794 2795

Core and builtins
-----------------

2796 2797 2798 2799 2800
- A program can now set the environment variable PYTHONINSPECT to some
  string value in Python, and cause the interpreter to enter the
  interactive prompt at program exit, as if Python had been invoked
  with the -i option.

2801 2802 2803
- list.index() now accepts optional start and stop arguments.  Similar
  changes were made to UserList.index(). SF feature request 754014.

2804 2805 2806
- SF patch 751998 fixes an unwanted side effect of the previous fix
  for SF bug 742860 (the next item).

2807 2808 2809 2810 2811 2812 2813 2814 2815 2816 2817 2818 2819
- SF bug 742860: "WeakKeyDictionary __delitem__ uses iterkeys".  This
  wasn't threadsafe, was very inefficient (expected time O(len(dict))
  instead of O(1)), and could raise a spurious RuntimeError if another
  thread mutated the dict during __delitem__, or if a comparison function
  mutated it.  It also neglected to raise KeyError when the key wasn't
  present; didn't raise TypeError when the key wasn't of a weakly
  referencable type; and broke various more-or-less obscure dict
  invariants by using a sequence of equality comparisons over the whole
  set of dict keys instead of computing the key's hash code to narrow
  the search to those keys with the same hash code.  All of these are
  considered to be bugs.  A new implementation of __delitem__ repairs all
  that, but note that fixing these bugs may change visible behavior in
  code relying (whether intentionally or accidentally) on old behavior.
2820

2821 2822 2823 2824
- SF bug 734869: Fixed a compiler bug that caused a fatal error when
  compiling a list comprehension that contained another list comprehension
  embedded in a lambda expression.

2825 2826 2827 2828 2829 2830
- SF bug 705231:  builtin pow() no longer lets the platform C pow()
  raise -1.0 to integer powers, because (at least) glibc gets it wrong
  in some cases.  The result should be -1.0 if the power is odd and 1.0
  if the power is even, and any float with a sufficiently large exponent
  is (mathematically) an exact even integer.

2831 2832 2833 2834
- SF bug 759227: A new-style class that implements __nonzero__() must
  return a bool or int (but not an int subclass) from that method.  This
  matches the restriction on classic classes.

2835 2836 2837
- The encoding attribute has been added for file objects, and set to
  the terminal encoding on Unix and Windows.

2838 2839 2840
- The softspace attribute of file objects became read-only by oversight.
  It's writable again.

2841 2842 2843
- Reverted a 2.3 beta 1 change to iterators for subclasses of list and
  tuple.  By default, the iterators now access data elements directly
  instead of going through __getitem__.  If __getitem__ access is
2844
  preferred, then __iter__ can be overridden.
2845

2846 2847 2848 2849
- SF bug 735247: The staticmethod and super types participate in
  garbage collection. Before this change, it was possible for leaks to
  occur in functions with non-global free variables that used these types.

Tim Peters's avatar
Tim Peters committed
2850 2851 2852
Extension modules
-----------------

2853 2854 2855
- the socket module has a new exception, socket.timeout, to allow
  timeouts to be handled separately from other socket errors.

2856 2857 2858 2859 2860 2861 2862 2863
- SF bug 751276: cPickle has fixed to propagate exceptions raised in
  user code.  In earlier versions, cPickle caught and ignored any
  exception when it performed operations that it expected to raise
  specific exceptions like AttributeError.

- cPickle Pickler and Unpickler objects now participate in garbage
  collection.

Tim Peters's avatar
Tim Peters committed
2864 2865 2866 2867
- mimetools.choose_boundary() could return duplicate strings at times,
  especially likely on Windows.  The strings returned are now guaranteed
  unique within a single program run.

2868 2869 2870
- thread.interrupt_main() raises KeyboardInterrupt in the main thread.
  dummy_thread has also been modified to try to simulate the behavior.

2871 2872 2873
- array.array.insert() now treats negative indices as being relative
  to the end of the array, just like list.insert() does. (SF bug #739313)

2874 2875
- The datetime module classes datetime, time, and timedelta are now
  properly subclassable.
2876

2877 2878
- _tkinter.{get|set}busywaitinterval was added.

2879 2880 2881
- itertools.islice() now accepts stop=None as documented.
  Fixes SF bug #730685.

2882 2883 2884 2885
- the bsddb185 module is built in one restricted instance -
  /usr/include/db.h exists and defines HASHVERSION to be 2.  This is true
  for many BSD-derived systems.

2886

Tim Peters's avatar
Tim Peters committed
2887 2888 2889
Library
-------

2890 2891 2892 2893 2894 2895 2896 2897 2898 2899 2900 2901 2902 2903 2904
- Some happy doctest extensions from Jim Fulton have been added to
  doctest.py.  These are already being used in Zope3.  The two
  primary ones:

  doctest.debug(module, name) extracts the doctests from the named object
  in the given module, puts them in a temp file, and starts pdb running
  on that file.  This is great when a doctest fails.

  doctest.DocTestSuite(module=None) returns a synthesized unittest
  TestSuite instance, to be run by the unittest framework, which
  runs all the doctests in the module.  This allows writing tests in
  doctest style (which can be clearer and shorter than writing tests
  in unittest style), without losing unittest's powerful testing
  framework features (which doctest lacks).

2905 2906 2907 2908 2909 2910 2911 2912
- For compatibility with doctests created before 2.3, if an expected
  output block consists solely of "1" and the actual output block
  consists solely of "True", it's accepted as a match; similarly
  for "0" and "False".  This is quite un-doctest-like, but is practical.
  The behavior can be disabled by passing the new doctest module
  constant DONT_ACCEPT_TRUE_FOR_1 to the new optionflags optional
  argument.

2913 2914 2915 2916 2917 2918 2919
- ZipFile.testzip() now only traps BadZipfile exceptions.  Previously,
  a bare except caught to much and reported all errors as a problem
  in the archive.

- The logging module now has a new function, makeLogRecord() making
  LogHandler easier to interact with DatagramHandler and SocketHandler.

2920 2921 2922
- The cgitb module has been extended to support plain text display (SF patch
  569574).

2923 2924 2925 2926
- A brand new version of IDLE (from the IDLEfork project at
  SourceForge) is now included as Lib/idlelib.  The old Tools/idle is
  no more.

2927 2928 2929 2930 2931 2932
- Added a new module: trace (documentation missing).  This module used
  to be distributed in Tools/scripts.  It uses sys.settrace() to trace
  code execution -- either function calls or individual lines.  It can
  generate tracing output during execution or a post-mortem report of
  code coverage.

2933 2934 2935
- The threading module has new functions settrace() and setprofile()
  that cooperate with the functions of the same name in the sys
  module.  A function registered with the threading module will
2936 2937
  be used for all threads it creates.  The new trace module uses this
  to provide tracing for code running in threads.
2938

2939 2940 2941
- copy.py: applied SF patch 707900, fixing bug 702858, by Steven
  Taschuk.  Copying a new-style class that had a reference to itself
  didn't work.  (The same thing worked fine for old-style classes.)
2942
  Builtin functions are now treated as atomic, fixing bug #746304.
2943

2944 2945
- difflib.py has two new functions:  context_diff() and unified_diff().

2946 2947 2948 2949 2950
- More fixes to urllib (SF 549151): (a) When redirecting, always use
  GET.  This is common practice and more-or-less sanctioned by the
  HTTP standard. (b) Add a handler for 307 redirection, which becomes
  an error for POST, but a regular redirect for GET and HEAD

2951 2952 2953
- Added optional 'onerror' argument to os.walk(), to control error
  handling.

2954 2955 2956
- inspect.is{method|data}descriptor was added, to allow pydoc display
  __doc__ of data descriptors.

2957 2958 2959
- Fixed socket speed loss caused by use of the _socketobject wrapper class
  in socket.py.

2960 2961
- timeit.py now checks the current directory for imports.

2962 2963 2964 2965 2966
- urllib2.py now knows how to order proxy classes, so the user doesn't
  have to insert it in front of other classes, nor do dirty tricks like
  inserting a "dummy" HTTPHandler after a ProxyHandler when building an
  opener with proxy support.

2967 2968 2969 2970
- Iterators have been added for dbm keys.

- random.Random objects can now be pickled.

Tim Peters's avatar
Tim Peters committed
2971 2972 2973
Tools/Demos
-----------

2974 2975
- pydoc now offers help on keywords and topics.

2976 2977
- Tools/idle is gone; long live Lib/idlelib.

2978 2979
- diff.py prints file diffs in context, unified, or ndiff formats,
  providing a command line interface to difflib.py.
2980

2981 2982 2983
- texcheck.py is a new script for making a rough validation of Python LaTeX
  files.

Tim Peters's avatar
Tim Peters committed
2984 2985 2986
Build
-----

2987
- Setting DESTDIR during 'make install' now allows specifying a
2988 2989
  different root directory.

Tim Peters's avatar
Tim Peters committed
2990 2991 2992
C API
-----

2993 2994 2995 2996 2997 2998 2999 3000 3001 3002 3003
- PyType_Ready():  If a type declares that it participates in gc
  (Py_TPFLAGS_HAVE_GC), and its base class does not, and its base class's
  tp_free slot is the default _PyObject_Del, and type does not define
  a tp_free slot itself, _PyObject_GC_Del is assigned to type->tp_free.
  Previously _PyObject_Del was inherited, which could at best lead to a
  segfault.  In addition, if even after this magic the type's tp_free
  slot is _PyObject_Del or NULL, and the type is a base type
  (Py_TPFLAGS_BASETYPE), TypeError is raised:  since the type is a base
  type, its dealloc function must call type->tp_free, and since the type
  is gc'able, tp_free must not be NULL or _PyObject_Del.

3004 3005 3006 3007 3008 3009
- PyThreadState_SetAsyncExc(): A new API (deliberately accessible only
  from C) to interrupt a thread by sending it an exception.  It is
  intentional that you have to write your own C extension to call it
  from Python.


Tim Peters's avatar
Tim Peters committed
3010 3011 3012 3013 3014 3015 3016 3017 3018 3019 3020 3021 3022 3023
New platforms
-------------

None this time.

Tests
-----

- test_imp rewritten so that it doesn't raise RuntimeError if run as a
  side effect of being imported ("import test.autotest").

Windows
-------

3024 3025
- The Windows installer ships with Tcl/Tk 8.4.3 (upgraded from 8.4.1).

3026 3027 3028 3029 3030 3031 3032 3033 3034 3035
- The installer always suggested that Python be installed on the C:
  drive, due to a hardcoded "C:" generated by the Wise installation
  wizard.  People with machines where C: is not the system drive
  usually want Python installed on whichever drive is their system drive
  instead.  We removed the hardcoded "C:", and two testers on machines
  where C: is not the system drive report that the installer now
  suggests their system drive.  Note that you can always select the
  directory you want in the "Select Destination Directory" dialog --
  that's what it's for.

Tim Peters's avatar
Tim Peters committed
3036 3037 3038
Mac
---

Just van Rossum's avatar
Just van Rossum committed
3039 3040 3041 3042
- There's a new module called "autoGIL", which offers a mechanism to
  automatically release the Global Interpreter Lock when an event loop
  goes to sleep, allowing other threads to run. It's currently only
  supported on OSX, in the Mach-O version.
3043 3044 3045
- The OSA modules now allow direct access to properties of the
  toplevel application class (in AppleScript terminology).
- The Package Manager can now update itself.
Tim Peters's avatar
Tim Peters committed
3046

3047 3048 3049 3050 3051 3052 3053 3054 3055 3056 3057 3058 3059 3060 3061 3062 3063 3064 3065 3066 3067 3068 3069 3070 3071 3072 3073 3074 3075 3076
SourceForge Bugs and Patches Applied
------------------------------------

430160, 471893, 501716, 542562, 549151, 569574, 595837, 596434,
598163, 604210, 604716, 610332, 612627, 614770, 620190, 621891,
622042, 639139, 640236, 644345, 649742, 649742, 658233, 660022,
661318, 661676, 662807, 662923, 666219, 672855, 678325, 682347,
683486, 684981, 685773, 686254, 692776, 692959, 693094, 696777,
697989, 700827, 703666, 708495, 708604, 708901, 710733, 711902,
713722, 715782, 718286, 719359, 719367, 723136, 723831, 723962,
724588, 724767, 724767, 725942, 726150, 726446, 726869, 727051,
727719, 727719, 727805, 728277, 728563, 728656, 729096, 729103,
729293, 729297, 729300, 729317, 729395, 729622, 729817, 730170,
730296, 730594, 730685, 730826, 730963, 731209, 731403, 731504,
731514, 731626, 731635, 731643, 731644, 731644, 731689, 732124,
732143, 732234, 732284, 732284, 732479, 732761, 732783, 732951,
733667, 733781, 734118, 734231, 734869, 735051, 735293, 735527,
735613, 735694, 736962, 736962, 737970, 738066, 739313, 740055,
740234, 740301, 741806, 742126, 742741, 742860, 742860, 742911,
744041, 744104, 744238, 744687, 744877, 745055, 745478, 745525,
745620, 746012, 746304, 746366, 746801, 746953, 747348, 747667,
747954, 748846, 748849, 748973, 748975, 749191, 749210, 749759,
749831, 749911, 750008, 750092, 750542, 750595, 751038, 751107,
751276, 751451, 751916, 751941, 751956, 751998, 752671, 753451,
753602, 753617, 753845, 753925, 754014, 754340, 754447, 755031,
755087, 755147, 755245, 755683, 755987, 756032, 756996, 757058,
757229, 757818, 757821, 757822, 758112, 758910, 759227, 759889,
760257, 760703, 760792, 761104, 761337, 761519, 761830, 762455


Guido van Rossum's avatar
Guido van Rossum committed
3077 3078 3079
What's New in Python 2.3 beta 1?
================================

3080
*Release date: 25-Apr-2003*
Guido van Rossum's avatar
Guido van Rossum committed
3081 3082 3083 3084

Core and builtins
-----------------

3085 3086 3087
- New format codes B, H, I, k and K have been implemented for
  PyArg_ParseTuple and PyBuild_Value.

3088 3089 3090 3091
- New builtin function sum(seq, start=0) returns the sum of all the
  items in iterable object seq, plus start (items are normally numbers,
  and cannot be strings).

3092 3093 3094 3095 3096
- bool() called without arguments now returns False rather than
  raising an exception.  This is consistent with calling the
  constructors for the other builtin types -- called without argument
  they all return the false value of that type.  (SF patch #724135)

3097 3098 3099 3100 3101
- In support of PEP 269 (making the pgen parser generator accessible
  from Python), some changes to the pgen code structure were made; a
  few files that used to be linked only with pgen are now linked with
  Python itself.

3102 3103 3104
- The repr() of a weakref object now shows the __name__ attribute of
  the referenced object, if it has one.

3105 3106 3107 3108
- super() no longer ignores data descriptors, except __class__.  See
  the thread started at
  http://mail.python.org/pipermail/python-dev/2003-April/034338.html

3109 3110 3111 3112 3113
- list.insert(i, x) now interprets negative i as it would be
  interpreted by slicing, so negative values count from the end of the
  list.  This was the only place where such an interpretation was not
  placed on a list index.

3114 3115 3116 3117 3118
- range() now works even if the arguments are longs with magnitude
  larger than sys.maxint, as long as the total length of the sequence
  fits.  E.g., range(2**100, 2**101, 2**100) is the following list:
  [1267650600228229401496703205376L].  (SF patch #707427.)

3119 3120 3121 3122 3123 3124 3125 3126
- Some horridly obscure problems were fixed involving interaction
  between garbage collection and old-style classes with "ambitious"
  getattr hooks.  If an old-style instance didn't have a __del__ method,
  but did have a __getattr__ hook, and the instance became reachable
  only from an unreachable cycle, and the hook resurrected or deleted
  unreachable objects when asked to resolve "__del__", anything up to
  a segfault could happen.  That's been repaired.

3127 3128 3129 3130 3131 3132
- dict.pop now takes an optional argument specifying a default
  value to return if the key is not in the dict.  If a default is not
  given and the key is not found, a KeyError will still be raised.
  Parallel changes were made to UserDict.UserDict and UserDict.DictMixin.
  [SF patch #693753] (contributed by Michael Stone.)

3133 3134
- sys.getfilesystemencoding() was added to expose
  Py_FileSystemDefaultEncoding.
3135 3136 3137 3138 3139 3140

- New function sys.exc_clear() clears the current exception.  This is
  rarely needed, but can sometimes be useful to release objects
  referenced by the traceback held in sys.exc_info()[2].  (SF patch
  #693195.)

3141 3142
- On 64-bit systems, a dictionary could contain duplicate long/int keys
  if the key value was larger than 2**32.  See SF bug #689659.
Guido van Rossum's avatar
Guido van Rossum committed
3143

3144
- Fixed SF bug #663074. The codec system was using global static
Walter Dörwald's avatar
Walter Dörwald committed
3145
  variables to store internal data. As a result, any attempts to use the
3146 3147 3148
  unicode system with multiple active interpreters, or successive
  interpreter executions, would fail.

3149
- "%c" % u"a" now returns a unicode string instead of raising a
3150
  TypeError. u"%c" % 0xffffffff now raises a OverflowError instead
3151
  of a ValueError to be consistent with "%c" % 256. See SF patch #710127.
3152

Guido van Rossum's avatar
Guido van Rossum committed
3153 3154 3155
Extension modules
-----------------

3156 3157 3158 3159 3160
- The socket module now provides the functions inet_pton and inet_ntop
  for converting between string and packed representation of IP
  addresses.  There is also a new module variable, has_ipv6, which is
  True iff the current Python has IPv6 support.  See SF patch #658327.

3161 3162 3163
- Tkinter wrappers around Tcl variables now pass objects directly
  to Tcl, instead of first converting them to strings.

3164 3165 3166 3167
- The .*? pattern in the re module is now special-cased to avoid the
  recursion limit.  (SF patch #720991 -- many thanks to Gary Herron
  and Greg Chapman.)

3168 3169 3170
- New function sys.call_tracing() allows pdb to debug code
  recursively.

3171
- New function gc.get_referents(obj) returns a list of objects
3172 3173 3174 3175
  directly referenced by obj.  In effect, it exposes what the object's
  tp_traverse slot does, and can be helpful when debugging memory
  leaks.

3176 3177
- The iconv module has been removed from this release.

3178 3179 3180 3181 3182 3183 3184 3185
- The platform-independent routines for packing floats in IEEE formats
  (struct.pack's <f, >f, <d, and >d codes; pickle and cPickle's protocol 1
  pickling of floats) ignored that rounding can cause a carry to
  propagate.  The worst consequence was that, in rare cases, <f and >f
  could produce strings that, when unpacked again, were a factor of 2
  away from the original float.  This has been fixed.  See SF bug
  #705836.

3186
- New function time.tzset() provides access to the C library tzset()
3187 3188
  function, if supported.  (SF patch #675422.)

3189 3190 3191 3192
- Using createfilehandler, deletefilehandler, createtimerhandler functions
  on Tkinter.tkinter (_tkinter module) no longer crashes the interpreter.
  See SF bug #692416.

Michael W. Hudson's avatar
Fix bug  
Michael W. Hudson committed
3193 3194 3195
- Modified the fcntl.ioctl() function to allow modification of a passed
  mutable buffer (for details see the reference documentation).

3196 3197 3198
- Made user requested changes to the itertools module.
  Subsumed the times() function into repeat().
  Added chain() and cycle().
Guido van Rossum's avatar
Guido van Rossum committed
3199

3200
- The rotor module is now deprecated; the encryption algorithm it uses
3201 3202 3203
  is not believed to be secure, and including crypto code with Python
  has implications for exporting and importing it in various countries.

3204 3205 3206 3207 3208
- The socket module now always uses the _socketobject wrapper class, even on
  platforms which have dup(2).  The makefile() method is built directly
  on top of the socket without duplicating the file descriptor, allowing
  timeouts to work properly.

Guido van Rossum's avatar
Guido van Rossum committed
3209 3210 3211
Library
-------

3212 3213 3214 3215 3216
- New generator function os.walk() is an easy-to-use alternative to
  os.path.walk().  See os module docs for details.  os.path.walk()
  isn't deprecated at this time, but may become deprecated in a
  future release.

3217 3218 3219
- Added new module "platform" which provides a wide range of tools
  for querying platform dependent features.

Andrew M. Kuchling's avatar
Andrew M. Kuchling committed
3220
- netrc now allows ASCII punctuation characters in passwords.
3221

3222 3223 3224
- shelve now supports the optional writeback argument, and exposes
  pickle protocol versions.

3225 3226 3227 3228
- Several methods of nntplib.NNTP have grown an optional file argument
  which specifies a file where to divert the command's output
  (already supported by the body() method).  (SF patch #720468)

3229 3230
- The self-documenting XML server library DocXMLRPCServer was added.

3231 3232 3233 3234 3235
- Support for internationalized domain names has been added through
  the 'idna' and 'punycode' encodings, the 'stringprep' module, the
  'mkstringprep' tool, and enhancements to the socket and httplib
  modules.

3236 3237 3238 3239
- htmlentitydefs has two new dictionaries: name2codepoint maps
  HTML entity names to Unicode codepoints (as integers).
  codepoint2name is the reverse mapping. See SF patch #722017.

3240 3241 3242
- pdb has a new command, "debug", which lets you step through
  arbitrary code from the debugger's (pdb) prompt.

3243 3244 3245 3246
- unittest.failUnlessEqual and its equivalent unittest.assertEqual now
  return 'not a == b' rather than 'a != b'.  This gives the desired
  result for classes that define __eq__ without defining __ne__.

3247
- sgmllib now supports SGML marked sections, in particular the
3248 3249
  MS Office extensions.

3250 3251 3252
- The urllib module now offers support for the iterator protocol.
  SF patch 698520 contributed by Brett Cannon.

Guido van Rossum's avatar
Guido van Rossum committed
3253 3254 3255
- New module timeit provides a simple framework for timing the
  execution speed of expressions and statements.

3256 3257 3258 3259 3260 3261 3262
- sets.Set objects now support mixed-type __eq__ and __ne__, instead
  of raising TypeError.  If x is a Set object and y is a non-Set object,
  x == y is False, and x != y is True.  This is akin to the change made
  for mixed-type comparisons of datetime objects in 2.3a2; more info
  about the rationale is in the NEWS entry for that.  See also SF bug
  report <http://www.python.org/sf/693121>.

3263 3264 3265
- On Unix platforms, if os.listdir() is called with a Unicode argument,
  it now returns Unicode strings.  (This behavior was added earlier
  to the Windows NT/2k/XP version of os.listdir().)
Guido van Rossum's avatar
Guido van Rossum committed
3266

3267 3268 3269 3270
- Distutils: both 'py_modules' and 'packages' keywords can now be specified
  in core.setup().  Previously you could supply one or the other, but
  not both of them.  (SF patch #695090 from Bernhard Herzog)

Skip Montanaro's avatar
Skip Montanaro committed
3271 3272
- New csv package makes it easy to read/write CSV files.

3273 3274 3275 3276
- Module shlex has been extended to allow posix-like shell parsings,
  including a split() function for easy spliting of quoted strings and
  commands. An iterator interface was also implemented.

Guido van Rossum's avatar
Guido van Rossum committed
3277 3278 3279
Tools/Demos
-----------

3280 3281 3282
- New script combinerefs.py helps analyze new PYTHONDUMPREFS output.
  See the module docstring for details.

Guido van Rossum's avatar
Guido van Rossum committed
3283 3284 3285
Build
-----

3286 3287
- Fix problem building on OSF1 because the compiler only accepted
  preprocessor directives that start in column 1.  (SF bug #691793.)
Guido van Rossum's avatar
Guido van Rossum committed
3288 3289 3290 3291

C API
-----

3292 3293
- Added PyGC_Collect(), equivalent to calling gc.collect().

3294 3295 3296 3297
- PyThreadState_GetDict() was changed not to raise an exception or
  issue a fatal error when no current thread state is available.  This
  makes it possible to print dictionaries when no thread is active.

3298 3299
- LONG_LONG was renamed to PY_LONG_LONG.  Extensions that use this and
  need compatibility with previous versions can use this:
3300

3301 3302 3303
    #ifndef  PY_LONG_LONG
    #define  PY_LONG_LONG  LONG_LONG
    #endif
3304

3305
- Added PyObject_SelfIter() to fill the tp_iter slot for the
3306 3307
  typical case where the method returns its self argument.

3308 3309 3310
- The extended type structure used for heap types (new-style
  classes defined by Python code using a class statement) is now
  exported from object.h as PyHeapTypeObject.  (SF patch #696193.)
Guido van Rossum's avatar
Guido van Rossum committed
3311 3312 3313 3314

New platforms
-------------

3315
None this time.
Guido van Rossum's avatar
Guido van Rossum committed
3316 3317 3318 3319

Tests
-----

3320 3321
- test_timeout now requires -u network to be passed to regrtest to run.
  See SF bug #692988.
Guido van Rossum's avatar
Guido van Rossum committed
3322 3323 3324 3325

Windows
-------

3326 3327 3328
- os.fsync() now exists on Windows, and calls the Microsoft _commit()
  function.

3329 3330
- New function winsound.MessageBeep() wraps the Win32 API
  MessageBeep().
Guido van Rossum's avatar
Guido van Rossum committed
3331 3332 3333 3334

Mac
---

3335 3336
- os.listdir() now returns Unicode strings on MacOS X when called with
  a Unicode argument. See the general news item under "Library".
3337

3338 3339
- A new method MacOS.WMAvailable() returns true if it is safe to access
  the window manager, false otherwise.
3340

3341 3342 3343
- EasyDialogs dialogs are now movable-modal, and if the application is
  currently in the background they will ask to be moved to the foreground
  before displaying.
3344

3345
- OSA Scripting support has improved a lot, and gensuitemodule.py can now
3346 3347
  be used by mere mortals. The documentation is now also more or less
  complete.
3348

3349 3350
- The IDE (in a framework build) now includes introductory documentation
  in Apple Help Viewer format.
Guido van Rossum's avatar
Guido van Rossum committed
3351 3352


3353 3354 3355
What's New in Python 2.3 alpha 2?
=================================

3356
*Release date: 19-Feb-2003*
3357 3358

Core and builtins
3359
-----------------
3360

3361 3362 3363 3364
- Negative positions returned from PEP 293 error callbacks are now
  treated as being relative to the end of the input string. Positions
  that are out of bounds raise an IndexError.

3365 3366 3367 3368
- sys.path[0] (the directory from which the script is loaded) is now
  turned into an absolute pathname, unless it is the empty string.
  (SF patch #664376.)

3369 3370 3371 3372 3373 3374
- Finally fixed the bug in compile() and exec where a string ending
  with an indented code block but no newline would raise SyntaxError.
  This would have been a four-line change in parsetok.c...  Except
  codeop.py depends on this behavior, so a compilation flag had to be
  invented that causes the tokenizer to revert to the old behavior;
  this required extra changes to 2 .h files, 2 .c files, and 2 .py
3375
  files.  (Fixes SF bug #501622.)
3376

3377 3378 3379 3380
- If a new-style class defines neither __new__ nor __init__, its
  constructor would ignore all arguments.  This is changed now: the
  constructor refuses arguments in this case.  This might break code
  that worked under Python 2.2.  The simplest fix is to add a no-op
David Goodger's avatar
David Goodger committed
3381
  __init__: ``def __init__(self, *args, **kw): pass``.
3382

3383 3384 3385 3386 3387 3388 3389 3390 3391 3392 3393 3394
- Through a bytecode optimizer bug (and I bet you didn't even know
  Python *had* a bytecode optimizer :-), "unsigned" hex/oct constants
  with a leading minus sign would come out with the wrong sign.
  ("Unsigned" hex/oct constants are those with a face value in the
  range sys.maxint+1 through sys.maxint*2+1, inclusive; these have
  always been interpreted as negative numbers through sign folding.)
  E.g. 0xffffffff is -1, and -(0xffffffff) is 1, but -0xffffffff would
  come out as -4294967295.  This was the case in Python 2.2 through
  2.2.2 and 2.3a1, and in Python 2.4 it will once again have that
  value, but according to PEP 237 it really needs to be 1 now.  This
  will be backported to Python 2.2.3 a well.  (SF #660455)

3395 3396 3397 3398 3399 3400 3401
- int(s, base) sometimes sign-folds hex and oct constants; it only
  does this when base is 0 and s.strip() starts with a '0'.  When the
  sign is actually folded, as in int("0xffffffff", 0) on a 32-bit
  machine, which returns -1, a FutureWarning is now issued; in Python
  2.4, this will return 4294967295L, as do int("+0xffffffff", 0) and
  int("0xffffffff", 16) right now.  (PEP 347)

3402 3403 3404
- super(X, x): x may now be a proxy for an X instance, i.e.
  issubclass(x.__class__, X) but not issubclass(type(x), X).

3405 3406 3407 3408 3409
- isinstance(x, X): if X is a new-style class, this is now equivalent
  to issubclass(type(x), X) or issubclass(x.__class__, X).  Previously
  only type(x) was tested.  (For classic classes this was already the
  case.)

3410 3411 3412
- compile(), eval() and the exec statement now fully support source code
  passed as unicode strings.

3413 3414 3415
- int subclasses can be initialized with longs if the value fits in an int.
  See SF bug #683467.

3416 3417 3418
- long(string, base) takes time linear in len(string) when base is a power
  of 2 now.  It used to take time quadratic in len(string).

3419 3420
- filter returns now Unicode results for Unicode arguments.

3421 3422
- raw_input can now return Unicode objects.

3423 3424 3425 3426
- List objects' sort() method now accepts None as the comparison function.
  Passing None is semantically identical to calling sort() with no
  arguments.

3427 3428 3429
- Fixed crash when printing a subclass of str and __str__ returned self.
  See SF bug #667147.

3430
- Fixed an invalid RuntimeWarning and an undetected error when trying
3431
  to convert a long integer into a float which couldn't fit.
3432 3433
  See SF bug #676155.

3434
- Function objects now have a __module__ attribute that is bound to
3435
  the name of the module in which the function was defined.  This
3436 3437 3438 3439 3440 3441
  applies for C functions and methods as well as functions and methods
  defined in Python.  This attribute is used by pickle.whichmodule(),
  which changes the behavior of whichmodule slightly.  In Python 2.2
  whichmodule() returns "__main__" for functions that are not defined
  at the top-level of a module (examples: methods, nested functions).
  Now whichmodule() will return the proper module name.
3442

3443 3444 3445
Extension modules
-----------------

3446 3447 3448 3449
- operator.isNumberType() now checks that the object has a nb_int or
  nb_float slot, rather than simply checking whether it has a non-NULL
  tp_as_number pointer.

3450 3451 3452 3453 3454 3455
- The imp module now has ways to acquire and release the "import
  lock": imp.acquire_lock() and imp.release_lock().  Note: this is a
  reentrant lock, so releasing the lock only truly releases it when
  this is the last release_lock() call.  You can check with
  imp.lock_held().  (SF bug #580952 and patch #683257.)

3456 3457
- Change to cPickle to match pickle.py (see below and PEP 307).

3458 3459
- Fix some bugs in the parser module.  SF bug #678518.

3460
- Thanks to Scott David Daniels, a subtle bug in how the zlib
Walter Dörwald's avatar
Walter Dörwald committed
3461
  extension implemented flush() was fixed.  Scott also rewrote the
3462 3463 3464
  zlib test suite using the unittest module.  (SF bug #640230 and
  patch #678531.)

3465 3466 3467
- Added an itertools module containing high speed, memory efficient
  looping constructs inspired by tools from Haskell and SML.

3468 3469 3470
- The SSL module now handles sockets with a timeout set correctly (SF
  patch #675750, fixing SF bug #675552).

3471 3472
- os/posixmodule has grown the sysexits.h constants (EX_OK and friends).

3473 3474 3475 3476
- Fixed broken threadstate swap in readline that could cause fatal
  errors when a readline hook was being invoked while a background
  thread was active.  (SF bugs #660476 and #513033.)

3477 3478
- fcntl now exposes the strops.h I_* constants.

3479 3480 3481
- Fix a crash on Solaris that occurred when calling close() on
  an mmap'ed file which was already closed.  (SF patch #665913)

Raymond Hettinger's avatar
Raymond Hettinger committed
3482
- Fixed several serious bugs in the zipimport implementation.
3483

3484 3485
- datetime changes:

3486 3487
  The date class is now properly subclassable.  (SF bug #720908)

3488 3489 3490 3491 3492 3493 3494
  The datetime and datetimetz classes have been collapsed into a single
  datetime class, and likewise the time and timetz classes into a single
  time class.  Previously, a datetimetz object with tzinfo=None acted
  exactly like a datetime object, and similarly for timetz.  This wasn't
  enough of a difference to justify distinct classes, and life is simpler
  now.

3495
  today() and now() now round system timestamps to the closest
3496 3497
  microsecond <http://www.python.org/sf/661086>.  This repairs an
  irritation most likely seen on Windows systems.
3498

3499
  In dt.astimezone(tz), if tz.utcoffset(dt) returns a duration,
3500
  ValueError is raised if tz.dst(dt) returns None (2.3a1 treated it
3501 3502 3503 3504
  as 0 instead, but a tzinfo subclass wishing to participate in
  time zone conversion has to take a stand on whether it supports
  DST; if you don't care about DST, then code dst() to return 0 minutes,
  meaning that DST is never in effect).
3505

3506 3507 3508
  The tzinfo methods utcoffset() and dst() must return a timedelta object
  (or None) now.  In 2.3a1 they could also return an int or long, but that
  was an unhelpfully redundant leftover from an earlier version wherein
3509 3510
  they couldn't return a timedelta.  TOOWTDI.

3511 3512 3513
  The example tzinfo class for local time had a bug.  It was replaced
  by a later example coded by Guido.

3514
  datetime.astimezone(tz) no longer raises an exception when the
3515 3516 3517
  input datetime has no UTC equivalent in tz.  For typical "hybrid" time
  zones (a single tzinfo subclass modeling both standard and daylight
  time), this case can arise one hour per year, at the hour daylight time
3518 3519 3520 3521 3522 3523 3524 3525 3526 3527 3528 3529 3530 3531 3532 3533 3534
  ends.  See new docs for details.  In short, the new behavior mimics
  the local wall clock's behavior of repeating an hour in local time.

  dt.astimezone() can no longer be used to convert between naive and aware
  datetime objects.  If you merely want to attach, or remove, a tzinfo
  object, without any conversion of date and time members, use
  dt.replace(tzinfo=whatever) instead, where "whatever" is None or a
  tzinfo subclass instance.

  A new method tzinfo.fromutc(dt) can be overridden in tzinfo subclasses
  to give complete control over how a UTC time is to be converted to
  a local time.  The default astimezone() implementation calls fromutc()
  as its last step, so a tzinfo subclass can affect that too by overriding
  fromutc().  It's expected that the default fromutc() implementation will
  be suitable as-is for "almost all" time zone subclasses, but the
  creativity of political time zone fiddling appears unbounded -- fromutc()
  allows the highly motivated to emulate any scheme expressible in Python.
3535

3536 3537 3538 3539 3540
  datetime.now():  The optional tzinfo argument was undocumented (that's
  repaired), and its name was changed to tz ("tzinfo" is overloaded enough
  already).  With a tz argument, now(tz) used to return the local date
  and time, and attach tz to it, without any conversion of date and time
  members.  This was less than useful.  Now now(tz) returns the current
David Goodger's avatar
David Goodger committed
3541 3542
  date and time as local time in tz's time zone, akin to ::

3543
      tz.fromutc(datetime.utcnow().replace(tzinfo=utc))
David Goodger's avatar
David Goodger committed
3544

3545 3546 3547 3548
  where "utc" is an instance of a tzinfo subclass modeling UTC.  Without
  a tz argument, now() continues to return the current local date and time,
  as a naive datetime object.

3549 3550 3551 3552
  datetime.fromtimestamp():  Like datetime.now() above, this had less than
  useful behavior when the optional tinzo argument was specified.  See
  also SF bug report <http://www.python.org/sf/660872>.

3553 3554 3555 3556 3557 3558 3559 3560
  date and datetime comparison:  In order to prevent comparison from
  falling back to the default compare-object-addresses strategy, these
  raised TypeError whenever they didn't understand the other object type.
  They still do, except when the other object has a "timetuple" attribute,
  in which case they return NotImplemented now.  This gives other
  datetime objects (e.g., mxDateTime) a chance to intercept the
  comparison.

3561 3562 3563 3564
  date, time, datetime and timedelta comparison:  When the exception
  for mixed-type comparisons in the last paragraph doesn't apply, if
  the comparison is == then False is returned, and if the comparison is
  != then True is returned.  Because dict lookup and the "in" operator
David Goodger's avatar
David Goodger committed
3565
  only invoke __eq__, this allows, for example, ::
3566 3567

      if some_datetime in some_sequence:
David Goodger's avatar
David Goodger committed
3568 3569 3570

  and ::

3571 3572 3573 3574 3575 3576 3577
      some_dict[some_timedelta] = whatever

  to work as expected, without raising TypeError just because the
  sequence is heterogeneous, or the dict has mixed-type keys.  [This
  seems like a good idea to implement for all mixed-type comparisons
  that don't want to allow falling back to address comparison.]

3578 3579 3580 3581 3582 3583
  The constructors building a datetime from a timestamp could raise
  ValueError if the platform C localtime()/gmtime() inserted "leap
  seconds".  Leap seconds are ignored now.  On such platforms, it's
  possible to have timestamps that differ by a second, yet where
  datetimes constructed from them are equal.

3584 3585
  The pickle format of date, time and datetime objects has changed
  completely.  The undocumented pickler and unpickler functions no
3586 3587
  longer exist.  The undocumented __setstate__() and __getstate__()
  methods no longer exist either.
3588

3589 3590 3591
Library
-------

3592 3593 3594 3595 3596 3597 3598 3599 3600 3601 3602 3603 3604
- The logging module was updated slightly; the WARN level was renamed
  to WARNING, and the matching function/method warn() to warning().

- The pickle and cPickle modules were updated with a new pickling
  protocol (documented by pickletools.py, see below) and several
  extensions to the pickle customization API (__reduce__, __setstate__
  etc.).  The copy module now uses more of the pickle customization
  API to copy objects that don't implement __copy__ or __deepcopy__.
  See PEP 307 for details.

- The distutils "register" command now uses http://www.python.org/pypi
  as the default repository.  (See PEP 301.)

Andrew MacIntyre's avatar
Andrew MacIntyre committed
3605 3606
- the platform dependent path related variables sep, altsep, extsep,
  pathsep, curdir, pardir and defpath are now defined in the platform
3607
  dependent path modules (e.g. ntpath.py) rather than os.py, so these
3608 3609
  variables are now available via os.path.  They continue to be
  available from the os module.
3610
  (see <http://www.python.org/sf/680789>).
Andrew MacIntyre's avatar
Andrew MacIntyre committed
3611

3612 3613 3614
- array.array was added to the types repr.py knows about (see
  <http://www.python.org/sf/680789>).

3615 3616 3617 3618
- The new pickletools.py contains lots of documentation about pickle
  internals, and supplies some helpers for working with pickles, such as
  a symbolic pickle disassembler.

3619 3620
- Xmlrpclib.py now supports the builtin boolean type.

3621 3622 3623
- py_compile has a new 'doraise' flag and a new PyCompileError
  exception.

3624 3625 3626
- SimpleXMLRPCServer now supports CGI through the CGIXMLRPCRequestHandler
  class.

3627 3628 3629 3630
- The sets module now raises TypeError in __cmp__, to clarify that
  sets are not intended to be three-way-compared; the comparison
  operators are overloaded as subset/superset tests.

3631 3632 3633
- Bastion.py and rexec.py are disabled.  These modules are not safe in
  Python 2.2. or 2.3.

David Goodger's avatar
David Goodger committed
3634
- realpath is now exported when doing ``from poxixpath import *``.
3635 3636 3637
  It is also exported for ntpath, macpath, and os2emxpath.
  See SF bug #659228.

3638 3639 3640 3641
- New module tarfile from Lars Gustäbel provides a comprehensive interface
  to tar archive files with transparent gzip and bzip2 compression.
  See SF patch #651082.

3642
- urlparse can now parse imap:// URLs.  See SF feature request #618024.
David Goodger's avatar
David Goodger committed
3643

3644 3645 3646
- Tkinter.Canvas.scan_dragto() provides an optional parameter to support
  the gain value which is passed to Tk.  SF bug# 602259.

3647
- Fix logging.handlers.SysLogHandler protocol when using UNIX domain sockets.
3648
  See SF patch #642974.
3649

3650 3651 3652
- The dospath module was deleted.  Use the ntpath module when manipulating
  DOS paths from other platforms.

3653 3654 3655
Tools/Demos
-----------

3656 3657 3658 3659 3660 3661 3662 3663 3664 3665
- Two new scripts (db2pickle.py and pickle2db.py) were added to the
  Tools/scripts directory to facilitate conversion from the old bsddb module
  to the new one.  While the user-visible API of the new module is
  compatible with the old one, it's likely that the version of the
  underlying database library has changed.  To convert from the old library,
  run the db2pickle.py script using the old version of Python to convert it
  to a pickle file.  After upgrading Python, run the pickle2db.py script
  using the new version of Python to reconstitute your database.  For
  example:

Skip Montanaro's avatar
braino  
Skip Montanaro committed
3666 3667
    % python2.2 db2pickle.py -h some.db > some.pickle
    % python2.3 pickle2db.py -h some.db.new < some.pickle
3668 3669 3670

  Run the scripts without any args to get a usage message.

David Goodger's avatar
David Goodger committed
3671

3672 3673 3674
Build
-----

3675 3676 3677
- The audio driver tests (test_ossaudiodev.py and
  test_linuxaudiodev.py) are no longer run by default.  This is
  because they don't always work, depending on your hardware and
David Goodger's avatar
David Goodger committed
3678 3679
  software.  To run these tests, you must use an invocation like ::

3680 3681
    ./python Lib/test/regrtest.py -u audio test_ossaudiodev

3682 3683 3684 3685 3686 3687 3688 3689 3690 3691 3692 3693 3694 3695 3696 3697 3698 3699
- On systems which build using the configure script, compiler flags which
  used to be lumped together using the OPT flag have been split into two
  groups, OPT and BASECFLAGS.  OPT is meant to carry just optimization- and
  debug-related flags like "-g" and "-O3".  BASECFLAGS is meant to carry
  compiler flags that are required to get a clean compile.  On some
  platforms (many Linux flavors in particular) BASECFLAGS will be empty by
  default.  On others, such as Mac OS X and SCO, it will contain required
  flags.  This change allows people building Python to override OPT without
  fear of clobbering compiler flags which are required to get a clean build.

- On Darwin/Mac OS X platforms, /sw/lib and /sw/include are added to the
  relevant search lists in setup.py.  This allows users building Python to
  take advantage of the many packages available from the fink project
  <http://fink.sf.net/>.

- A new Makefile target, scriptsinstall, installs a number of useful scripts
  from the Tools/scripts directory.

3700 3701 3702
C API
-----

David Goodger's avatar
David Goodger committed
3703 3704
- PyEval_GetFrame() is now declared to return a ``PyFrameObject *``
  instead of a plain ``PyObject *``.  (SF patch #686601.)
3705

3706 3707 3708 3709
- PyNumber_Check() now checks that the object has a nb_int or nb_float
  slot, rather than simply checking whether it has a non-NULL
  tp_as_number pointer.

3710 3711 3712 3713
- A C type that inherits from a base type that defines tp_as_buffer
  will now inherit the tp_as_buffer pointer if it doesn't define one.
  (SF #681367)

3714 3715 3716 3717
- The PyArg_Parse functions now issue a DeprecationWarning if a float
  argument is provided when an integer is specified (this affects the 'b',
  'B', 'h', 'H', 'i', and 'l' codes).  Future versions of Python will
  raise a TypeError.
3718

3719 3720 3721
Tests
-----

Walter Dörwald's avatar
Walter Dörwald committed
3722
- Several tests weren't being run from regrtest.py (test_timeout.py,
3723 3724 3725 3726 3727 3728 3729 3730 3731
  test_tarfile.py, test_netrc.py, test_multifile.py,
  test_importhooks.py and test_imp.py).  Now they are.  (Note to
  developers: please read Lib/test/README when creating a new test, to
  make sure to do it right!  All tests need to use either unittest or
  pydoc.)

- Added test_posix.py, a test suite for the posix module.

- Added test_hexoct.py, a test suite for hex/oct constant folding.
David Goodger's avatar
David Goodger committed
3732

3733 3734 3735
Windows
-------

3736 3737 3738 3739
- The timeout code for socket connect() didn't work right; this has
  now been fixed.  test_timeout.py should pass (at least most of the
  time).

3740 3741 3742
- distutils' msvccompiler class now passes the preprocessor options to
  the resource compiler.  See SF patch #669198.

3743 3744 3745
- The bsddb module now ships with Sleepycat's 4.1.25.NC, the latest
  release without strong cryptography.

3746
- sys.path[0], if it contains a directory name, is now always an
3747
  absolute pathname. (SF patch #664376.)
3748

3749 3750 3751
- The new logging package is now installed by the Windows installer.  It
  wasn't in 2.3a1 due to oversight.

3752 3753 3754
Mac
---

3755 3756
- There are new dialogs EasyDialogs.AskFileForOpen, AskFileForSave
  and AskFolder. The old macfs.StandardGetFile and friends are deprecated.
3757

3758 3759 3760
- Most of the standard library now uses pathnames or FSRefs in preference
  of FSSpecs, and use the underlying Carbon.File and Carbon.Folder modules
  in stead of macfs. macfs will probably be deprecated in the future.
3761

3762 3763
- Type Carbon.File.FSCatalogInfo and supporting methods have been implemented.
  This also makes macfs.FSSpec.SetDates() work again.
3764

3765 3766 3767 3768
- There is a new module pimp, the package install manager for Python, and
  accompanying applet PackageManager. These allow you to easily download
  and install pretested extension packages either in source or binary
  form. Only in MacPython-OSX.
3769

3770
- Applets are now built with bundlebuilder in MacPython-OSX, which should make
3771 3772 3773
  them more robust and also provides a path towards BuildApplication. The
  downside of this change is that applets can no longer be run from the
  Terminal window, this will hopefully be fixed in the 2.3b1.
David Goodger's avatar
David Goodger committed
3774

3775

Tim Peters's avatar
Tim Peters committed
3776 3777 3778
What's New in Python 2.3 alpha 1?
=================================

Guido van Rossum's avatar
Guido van Rossum committed
3779
*Release date: 31-Dec-2002*
3780

Tim Peters's avatar
Tim Peters committed
3781
Type/class unification and new-style classes
3782
--------------------------------------------
3783

3784 3785
- One can now assign to __bases__ and __name__ of new-style classes.

3786 3787
- dict() now accepts keyword arguments so that dict(one=1, two=2)
  is the equivalent of {"one": 1, "two": 2}.  Accordingly,
3788
  the existing (but undocumented) 'items' keyword argument has
3789
  been eliminated.  This means that dict(items=someMapping) now has
3790 3791
  a different meaning than before.

3792
- int() now returns a long object if the argument is outside the
David Goodger's avatar
David Goodger committed
3793
  integer range, so int("4" * 1000), int(1e200) and int(1L<<1000) will
3794
  all return long objects instead of raising an OverflowError.
Tim Peters's avatar
Tim Peters committed
3795

Andrew M. Kuchling's avatar
Andrew M. Kuchling committed
3796
- Assignment to __class__ is disallowed if either the old or the new
3797
  class is a statically allocated type object (such as defined by an
3798
  extension module).  This prevents anomalies like 2.__class__ = bool.
3799 3800 3801 3802 3803 3804 3805 3806 3807 3808 3809 3810 3811 3812 3813 3814 3815 3816 3817 3818 3819 3820 3821 3822 3823 3824 3825 3826 3827 3828 3829 3830 3831 3832 3833 3834 3835 3836 3837 3838 3839 3840 3841 3842 3843 3844 3845 3846

- New-style object creation and deallocation have been sped up
  significantly; they are now faster than classic instance creation
  and deallocation.

- The __slots__ variable can now mention "private" names, and the
  right thing will happen (e.g. __slots__ = ["__foo"]).

- The built-ins slice() and buffer() are now callable types.  The
  types classobj (formerly class), code, function, instance, and
  instancemethod (formerly instance-method), which have no built-in
  names but are accessible through the types module, are now also
  callable.  The type dict-proxy is renamed to dictproxy.

- Cycles going through the __class__ link of a new-style instance are
  now detected by the garbage collector.

- Classes using __slots__ are now properly garbage collected.
  [SF bug 519621]

- Tightened the __slots__ rules: a slot name must be a valid Python
  identifier.

- The constructor for the module type now requires a name argument and
  takes an optional docstring argument.  Previously, this constructor
  ignored its arguments.  As a consequence, deriving a class from a
  module (not from the module type) is now illegal; previously this
  created an unnamed module, just like invoking the module type did.
  [SF bug 563060]

- A new type object, 'basestring', is added.  This is a common base type
  for 'str' and 'unicode', and can be used instead of
  types.StringTypes, e.g. to test whether something is "a string":
  isinstance(x, basestring) is True for Unicode and 8-bit strings.  This
  is an abstract base class and cannot be instantiated directly.

- Changed new-style class instantiation so that when C's __new__
  method returns something that's not a C instance, its __init__ is
  not called.  [SF bug #537450]

- Fixed super() to work correctly with class methods.  [SF bug #535444]

- If you try to pickle an instance of a class that has __slots__ but
  doesn't define or override __getstate__, a TypeError is now raised.
  This is done by adding a bozo __getstate__ to the class that always
  raises TypeError.  (Before, this would appear to be pickled, but the
  state of the slots would be lost.)

Tim Peters's avatar
Tim Peters committed
3847
Core and builtins
3848
-----------------
Tim Peters's avatar
Tim Peters committed
3849

Guido van Rossum's avatar
Guido van Rossum committed
3850
- Import from zipfiles is now supported.  The name of a zipfile placed
3851 3852 3853 3854
  on sys.path causes the import statement to look for importable Python
  modules (with .py, pyc and .pyo extensions) and packages inside the
  zipfile.  The zipfile import follows the specification (though not
  the sample implementation) of PEP 273.  The semantics of __path__ are
Guido van Rossum's avatar
Guido van Rossum committed
3855 3856 3857
  compatible with those that have been implemented in Jython since
  Jython 2.1.

Raymond Hettinger's avatar
Raymond Hettinger committed
3858
- PEP 302 has been accepted.  Although it was initially developed to
3859
  support zipimport, it offers a new, general import hook mechanism.
3860 3861 3862 3863 3864 3865
  Several new variables have been added to the sys module:
  sys.meta_path, sys.path_hooks, and sys.path_importer_cache; these
  make extending the import statement much more convenient than
  overriding the __import__ built-in function.  For a description of
  these, see PEP 302.

3866 3867 3868 3869
- A frame object's f_lineno attribute can now be written to from a
  trace function to change which line will execute next.  A command to
  exploit this from pdb has been added.  [SF patch #643835]

3870 3871 3872 3873
- The _codecs support module for codecs.py was turned into a builtin
  module to assure that at least the builtin codecs are available
  to the Python parser for source code decoding according to PEP 263.

3874 3875 3876 3877
- issubclass now supports a tuple as the second argument, just like
  isinstance does. ``issubclass(X, (A, B))`` is equivalent to
  ``issubclass(X, A) or issubclass(X, B)``.

3878 3879 3880 3881 3882 3883 3884 3885 3886 3887
- Thanks to Armin Rigo, the last known way to provoke a system crash
  by cleverly arranging for a comparison function to mutate a list
  during a list.sort() operation has been fixed.  The effect of
  attempting to mutate a list, or even to inspect its contents or
  length, while a sort is in progress, is not defined by the language.
  The C implementation of Python 2.3 attempts to detect mutations,
  and raise ValueError if one occurs, but there's no guarantee that
  all mutations will be caught, or that any will be caught across
  releases or implementations.

Martin v. Löwis's avatar
Martin v. Löwis committed
3888
- Unicode file name processing for Windows (PEP 277) is implemented.
3889 3890
  All platforms now have an os.path.supports_unicode_filenames attribute,
  which is set to True on Windows NT/2000/XP, and False elsewhere.
Martin v. Löwis's avatar
Martin v. Löwis committed
3891

3892 3893 3894
- Codec error handling callbacks (PEP 293) are implemented.
  Error handling in unicode.encode or str.decode can now be customized.

3895 3896 3897 3898
- A subtle change to the semantics of the built-in function intern():
  interned strings are no longer immortal.  You must keep a reference
  to the return value intern() around to get the benefit.

3899 3900 3901
- Use of 'None' as a variable, argument or attribute name now
  issues a SyntaxWarning.  In the future, None may become a keyword.

Michael W. Hudson's avatar
Michael W. Hudson committed
3902 3903
- SET_LINENO is gone.  co_lnotab is now consulted to determine when to
  call the trace function.  C code that accessed f_lineno should call
3904 3905
  PyCode_Addr2Line instead (f_lineno is still there, but only kept up
  to date when there is a trace function set).
Michael W. Hudson's avatar
Michael W. Hudson committed
3906

3907 3908 3909 3910 3911 3912 3913 3914 3915 3916 3917 3918 3919 3920 3921 3922 3923 3924 3925 3926 3927 3928 3929
- There's a new warning category, FutureWarning.  This is used to warn
  about a number of situations where the value or sign of an integer
  result will change in Python 2.4 as a result of PEP 237 (integer
  unification).  The warnings implement stage B0 mentioned in that
  PEP.  The warnings are about the following situations:

    - Octal and hex literals without 'L' prefix in the inclusive range
      [0x80000000..0xffffffff]; these are currently negative ints, but
      in Python 2.4 they will be positive longs with the same bit
      pattern.

    - Left shifts on integer values that cause the outcome to lose
      bits or have a different sign than the left operand.  To be
      precise: x<<n where this currently doesn't yield the same value
      as long(x)<<n; in Python 2.4, the outcome will be long(x)<<n.

    - Conversions from ints to string that show negative values as
      unsigned ints in the inclusive range [0x80000000..0xffffffff];
      this affects the functions hex() and oct(), and the string
      formatting codes %u, %o, %x, and %X.  In Python 2.4, these will
      show signed values (e.g. hex(-1) currently returns "0xffffffff";
      in Python 2.4 it will return "-0x1").

3930 3931 3932 3933 3934 3935 3936 3937
- The bits manipulated under the cover by sys.setcheckinterval() have
  been changed.  Both the check interval and the ticker used to be
  per-thread values.  They are now just a pair of global variables.
  In addition, the default check interval was boosted from 10 to 100
  bytecode instructions.  This may have some effect on systems that
  relied on the old default value.  In particular, in multi-threaded
  applications which try to be highly responsive, response time will
  increase by some (perhaps imperceptible) amount.
3938

3939 3940 3941 3942 3943
- When multiplying very large integers, a version of the so-called
  Karatsuba algorithm is now used.  This is most effective if the
  inputs have roughly the same size.  If they both have about N digits,
  Karatsuba multiplication has O(N**1.58) runtime (the exponent is
  log_base_2(3)) instead of the previous O(N**2).  Measured results may
3944 3945 3946 3947 3948 3949 3950
  be better or worse than that, depending on platform quirks.  Besides
  the O() improvement in raw instruction count, the Karatsuba algorithm
  appears to have much better cache behavior on extremely large integers
  (starting in the ballpark of a million bits).  Note that this is a
  simple implementation, and there's no intent here to compete with,
  e.g., GMP.  It gives a very nice speedup when it applies, but a package
  devoted to fast large-integer arithmetic should run circles around it.
3951

3952 3953 3954
- u'%c' will now raise a ValueError in case the argument is an
  integer outside the valid range of Unicode code point ordinals.

3955 3956 3957 3958 3959 3960 3961 3962 3963 3964
- The tempfile module has been overhauled for enhanced security.  The
  mktemp() function is now deprecated; new, safe replacements are
  mkstemp() (for files) and mkdtemp() (for directories), and the
  higher-level functions NamedTemporaryFile() and TemporaryFile().
  Use of some global variables in this module is also deprecated; the
  new functions have keyword arguments to provide the same
  functionality.  All Lib, Tools and Demo modules that used the unsafe
  interfaces have been updated to use the safe replacements.  Thanks
  to Zack Weinberg!

3965 3966 3967 3968 3969
- When x is an object whose class implements __mul__ and __rmul__,
  1.0*x would correctly invoke __rmul__, but 1*x would erroneously
  invoke __mul__.  This was due to the sequence-repeat code in the int
  type.  This has been fixed now.

3970 3971 3972 3973
- Previously, "str1 in str2" required str1 to be a string of length 1.
  This restriction has been relaxed to allow str1 to be a string of
  any length.  Thus "'el' in 'hello world'" returns True now.

3974 3975 3976 3977 3978 3979 3980 3981
- File objects are now their own iterators.  For a file f, iter(f) now
  returns f (unless f is closed), and f.next() is similar to
  f.readline() when EOF is not reached; however, f.next() uses a
  readahead buffer that messes up the file position, so mixing
  f.next() and f.readline() (or other methods) doesn't work right.
  Calling f.seek() drops the readahead buffer, but other operations
  don't.  It so happens that this gives a nice additional speed boost
  to "for line in file:"; the xreadlines method and corresponding
3982
  module are now obsolete.  Thanks to Oren Tirosh!
3983

3984 3985 3986
- Encoding declarations (PEP 263, phase 1) have been implemented.  A
  comment of the form "# -*- coding: <encodingname> -*-" in the first
  or second line of a Python source file indicates the encoding.
3987

3988 3989 3990 3991 3992 3993 3994 3995
- list.sort() has a new implementation.  While cross-platform results
  may vary, and in data-dependent ways, this is much faster on many
  kinds of partially ordered lists than the previous implementation,
  and reported to be just as fast on randomly ordered lists on
  several major platforms.  This sort is also stable (if A==B and A
  precedes B in the list at the start, A precedes B after the sort too),
  although the language definition does not guarantee stability.  A
  potential drawback is that list.sort() may require temp space of
3996
  len(list)*2 bytes (``*4`` on a 64-bit machine).  It's therefore possible
3997 3998 3999
  for list.sort() to raise MemoryError now, even if a comparison function
  does not.  See <http://www.python.org/sf/587076> for full details.

4000 4001 4002 4003 4004 4005 4006 4007
- All standard iterators now ensure that, once StopIteration has been
  raised, all future calls to next() on the same iterator will also
  raise StopIteration.  There used to be various counterexamples to
  this behavior, which could caused confusion or subtle program
  breakage, without any benefits.  (Note that this is still an
  iterator's responsibility; the iterator framework does not enforce
  this.)

4008 4009
- Ctrl+C handling on Windows has been made more consistent with
  other platforms.  KeyboardInterrupt can now reliably be caught,
4010
  and Ctrl+C at an interactive prompt no longer terminates the
4011
  process under NT/2k/XP (it never did under Win9x).  Ctrl+C will
4012 4013
  interrupt time.sleep() in the main thread, and any child processes
  created via the popen family (on win2k; we can't make win9x work
4014 4015
  reliably) are also interrupted (as generally happens on for Linux/Unix.)
  [SF bugs 231273, 439992 and 581232]
4016

4017 4018 4019 4020
- sys.getwindowsversion() has been added on Windows.  This
  returns a tuple with information about the version of Windows
  currently running.

4021 4022 4023 4024 4025
- Slices and repetitions of buffer objects now consistently return
  a string.  Formerly, strings would be returned most of the time,
  but a buffer object would be returned when the repetition count
  was one or when the slice range was all inclusive.

4026 4027 4028
- Unicode objects in sys.path are no longer ignored but treated
  as directory names.

4029 4030 4031
- Fixed string.startswith and string.endswith builtin methods
  so they accept negative indices.  [SF bug 493951]

4032 4033 4034
- Fixed a bug with a continue inside a try block and a yield in the
  finally clause.  [SF bug 567538]

4035
- Most builtin sequences now support "extended slices", i.e. slices
4036 4037
  with a third "stride" parameter.  For example, "hello world"[::-1]
  gives "dlrow olleh".
4038

4039 4040 4041 4042 4043 4044
- A new warning PendingDeprecationWarning was added to provide
  direction on features which are in the process of being deprecated.
  The warning will not be printed by default.  To see the pending
  deprecations, use -Walways::PendingDeprecationWarning::
  as a command line option or warnings.filterwarnings() in code.

4045 4046 4047 4048 4049
- Deprecated features of xrange objects have been removed as
  promised.  The start, stop, and step attributes and the tolist()
  method no longer exist.  xrange repetition and slicing have been
  removed.

4050 4051 4052 4053
- New builtin function enumerate(x), from PEP 279.  Example:
  enumerate("abc") is an iterator returning (0,"a"), (1,"b"), (2,"c").
  The argument can be an arbitrary iterable object.

4054 4055 4056 4057
- The assert statement no longer tests __debug__ at runtime.  This means
  that assert statements cannot be disabled by assigning a false value
  to __debug__.

4058 4059 4060 4061
- A method zfill() was added to str and unicode, that fills a numeric
  string to the left with zeros.  For example,
  "+123".zfill(6) -> "+00123".

4062 4063 4064 4065
- Complex numbers supported divmod() and the // and % operators, but
  these make no sense.  Since this was documented, they're being
  deprecated now.

4066 4067 4068
- String and unicode methods lstrip(), rstrip() and strip() now take
  an optional argument that specifies the characters to strip.  For
  example, "Foo!!!?!?!?".rstrip("?!") -> "Foo".
Guido van Rossum's avatar
Guido van Rossum committed
4069

4070 4071 4072 4073 4074
- There's a new dictionary constructor (a class method of the dict
  class), dict.fromkeys(iterable, value=None).  It constructs a
  dictionary with keys taken from the iterable and all values set to a
  single value.  It can be used for building sets and for removing
  duplicates from sequences.
4075

Guido van Rossum's avatar
Guido van Rossum committed
4076 4077 4078
- Added a new dict method pop(key).  This removes and returns the
  value corresponding to key.  [SF patch #539949]

Guido van Rossum's avatar
Guido van Rossum committed
4079 4080 4081
- A new built-in type, bool, has been added, as well as built-in
  names for its two values, True and False.  Comparisons and sundry
  other operations that return a truth value have been changed to
Andrew M. Kuchling's avatar
Andrew M. Kuchling committed
4082
  return a bool instead.  Read PEP 285 for an explanation of why this
Guido van Rossum's avatar
Guido van Rossum committed
4083 4084
  is backward compatible.

Guido van Rossum's avatar
Guido van Rossum committed
4085 4086 4087 4088 4089 4090
- Fixed two bugs reported as SF #535905: under certain conditions,
  deallocating a deeply nested structure could cause a segfault in the
  garbage collector, due to interaction with the "trashcan" code;
  access to the current frame during destruction of a local variable
  could access a pointer to freed memory.

Guido van Rossum's avatar
Guido van Rossum committed
4091 4092 4093 4094 4095 4096
- The optional object allocator ("pymalloc") has been enabled by
  default.  The recommended practice for memory allocation and
  deallocation has been streamlined.  A header file is included,
  Misc/pymemcompat.h, which can be bundled with 3rd party extensions
  and lets them use the same API with Python versions from 1.5.2
  onwards.
4097

4098 4099 4100
- PyErr_Display will provide file and line information for all exceptions
  that have an attribute print_file_and_line, not just SyntaxErrors.

4101 4102 4103
- The UTF-8 codec will now encode and decode Unicode surrogates
  correctly and without raising exceptions for unpaired ones.

4104 4105 4106 4107 4108 4109
- Universal newlines (PEP 278) is implemented.  Briefly, using 'U'
  instead of 'r' when opening a text file for reading changes the line
  ending convention so that any of '\r', '\r\n', and '\n' is
  recognized (even mixed in one file); all three are converted to
  '\n', the standard Python line end character.

4110 4111 4112 4113
- file.xreadlines() now raises a ValueError if the file is closed:
  Previously, an xreadlines object was returned which would raise
  a ValueError when the xreadlines.next() method was called.

Andrew M. Kuchling's avatar
Andrew M. Kuchling committed
4114
- sys.exit() inadvertently allowed more than one argument.
4115 4116
  An exception will now be raised if more than one argument is used.

4117 4118 4119
- Changed evaluation order of dictionary literals to conform to the
  general left to right evaluation order rule. Now {f1(): f2()} will
  evaluate f1 first.
Gustavo Niemeyer's avatar
Gustavo Niemeyer committed
4120

Gustavo Niemeyer's avatar
Gustavo Niemeyer committed
4121 4122 4123
- Fixed bug #521782: when a file was in non-blocking mode, file.read()
  could silently lose data or wrongly throw an unknown error.

4124 4125 4126 4127
- The sq_repeat, sq_inplace_repeat, sq_concat and sq_inplace_concat
  slots are now always tried after trying the corresponding nb_* slots.
  This fixes a number of minor bugs (see bug #624807).

Neal Norwitz's avatar
Neal Norwitz committed
4128 4129
- Fix problem with dynamic loading on 64-bit AIX (see bug #639945).

Tim Peters's avatar
Tim Peters committed
4130
Extension modules
4131
-----------------
Tim Peters's avatar
Tim Peters committed
4132

4133 4134 4135 4136 4137
- Added three operators to the operator module:
    operator.pow(a,b) which is equivalent to:  a**b.
    operator.is_(a,b) which is equivalent to:  a is b.
    operator.is_not(a,b) which is equivalent to:  a is not b.

4138 4139
- posix.openpty now works on all systems that have /dev/ptmx.

Guido van Rossum's avatar
Guido van Rossum committed
4140 4141 4142
- A module zipimport exists to support importing code from zip
  archives.

Tim Peters's avatar
Tim Peters committed
4143 4144 4145 4146 4147 4148
- The new datetime module supplies classes for manipulating dates and
  times.  The basic design came from the Zope "fishbowl process", and
  favors practical commercial applications over calendar esoterica.  See

      http://www.zope.org/Members/fdrake/DateTimeWiki/FrontPage

4149 4150 4151
- _tkinter now returns Tcl objects, instead of strings. Objects which
  have Python equivalents are converted to Python objects, other objects
  are wrapped. This can be configured through the wantobjects method,
4152
  or Tkinter.wantobjects.
4153

4154 4155 4156 4157
- The PyBSDDB wrapper around the Sleepycat Berkeley DB library has
  been added as the package bsddb.  The traditional bsddb module is
  still available in source code, but not built automatically anymore,
  and is now named bsddb185.  This supports Berkeley DB versions from
4158 4159 4160 4161
  3.0 to 4.1.  For help converting your databases from the old module (which
  probably used an obsolete version of Berkeley DB) to the new module, see
  the db2pickle.py and pickle2db.py scripts described in the Tools/Demos
  section above.
Martin v. Löwis's avatar
Martin v. Löwis committed
4162

4163 4164
- unicodedata was updated to Unicode 3.2. It supports normalization
  and names for Hangul syllables and CJK unified ideographs.
4165

4166 4167
- resource.getrlimit() now returns longs instead of ints.

Martin v. Löwis's avatar
Martin v. Löwis committed
4168 4169 4170
- readline now dynamically adjusts its input/output stream if
  sys.stdin/stdout changes.

4171 4172 4173 4174
- The _tkinter module (and hence Tkinter) has dropped support for
  Tcl/Tk 8.0 and 8.1.  Only Tcl/Tk versions 8.2, 8.3 and 8.4 are
  supported.

4175 4176
- cPickle.BadPickleGet is now a class.

4177 4178
- The time stamps in os.stat_result are floating point numbers
  after stat_float_times has been called.
4179

4180 4181 4182
- If the size passed to mmap.mmap() is larger than the length of the
  file on non-Windows platforms, a ValueError is raised. [SF bug 585792]

4183 4184
- The xreadlines module is slated for obsolescence.

4185 4186 4187
- The strptime function in the time module is now always available (a
  Python implementation is used when the C library doesn't define it).

4188 4189 4190 4191
- The 'new' module is no longer an extension, but a Python module that
  only exists for backwards compatibility.  Its contents are no longer
  functions but callable type objects.

4192
- The bsddb.*open functions can now take 'None' as a filename.
4193
  This will create a temporary in-memory bsddb that won't be
4194
  written to disk.
4195

Martin v. Löwis's avatar
Martin v. Löwis committed
4196 4197
- posix.getloadavg, posix.lchown, posix.killpg, posix.mknod, and
  posix.getpgid have been added where available.
4198

4199 4200
- The locale module now exposes the C library's gettext interface. It
  also has a new function getpreferredencoding.
4201

Tim Peters's avatar
Tim Peters committed
4202 4203 4204 4205 4206
- A security hole ("double free") was found in zlib-1.1.3, a popular
  third party compression library used by some Python modules.  The
  hole was quickly plugged in zlib-1.1.4, and the Windows build of
  Python now ships with zlib-1.1.4.

4207 4208
- pwd, grp, and resource return enhanced tuples now, with symbolic
  field names.
4209

4210 4211 4212 4213 4214
- array.array is now a type object. A new format character
  'u' indicates Py_UNICODE arrays. For those, .tounicode and
  .fromunicode methods are available. Arrays now support __iadd__
  and __imul__.

Neal Norwitz's avatar
Neal Norwitz committed
4215
- dl now builds on every system that has dlfcn.h.  Failure in case
4216 4217 4218
  of sizeof(int)!=sizeof(long)!=sizeof(void*) is delayed until dl.open
  is called.

4219 4220 4221 4222
- The sys module acquired a new attribute, api_version, which evaluates
  to the value of the PYTHON_API_VERSION macro with which the
  interpreter was compiled.

4223 4224 4225
- Fixed bug #470582: sre module would return a tuple (None, 'a', 'ab')
  when applying the regular expression '^((a)c)?(ab)$' on 'ab'. It now
  returns (None, None, 'ab'), as expected. Also fixed handling of
4226
  lastindex/lastgroup match attributes in similar cases. For example,
4227 4228 4229
  when running the expression r'(a)(b)?b' over 'ab', lastindex must be
  1, not 2.

4230 4231 4232 4233 4234
- Fixed bug #581080: sre scanner was not checking the buffer limit
  before increasing the current pointer. This was creating an infinite
  loop in the search function, once the pointer exceeded the buffer
  limit.

4235 4236 4237 4238
- The os.fdopen function now enforces a file mode starting with the
  letter 'r', 'w' or 'a', otherwise a ValueError is raised. This fixes
  bug #623464.

Greg Ward's avatar
Greg Ward committed
4239 4240 4241 4242 4243
- The linuxaudiodev module is now deprecated; it is being replaced by
  ossaudiodev.  The interface has been extended to cover a lot more of
  OSS (see www.opensound.com), including most DSP ioctls and the
  OSS mixer API.  Documentation forthcoming in 2.3a2.

Tim Peters's avatar
Tim Peters committed
4244
Library
4245
-------
Tim Peters's avatar
Tim Peters committed
4246

4247 4248
- imaplib.py now supports SSL (Tino Lange and Piers Lauder).

4249 4250 4251 4252 4253
- Freeze's modulefinder.py has been moved to the standard library;
  slightly improved so it will issue less false missing submodule
  reports (see sf path #643711 for details).  Documentation will follow
  with Python 2.3a2.

4254 4255
- os.path exposes getctime.

4256
- unittest.py now has two additional methods called assertAlmostEqual()
Raymond Hettinger's avatar
Raymond Hettinger committed
4257
  and failIfAlmostEqual().  They implement an approximate comparison
4258
  by rounding the difference between the two arguments and comparing
Raymond Hettinger's avatar
Raymond Hettinger committed
4259
  the result to zero.  Approximate comparison is essential for
4260 4261
  unit tests of floating point results.

4262 4263 4264 4265
- calendar.py now depends on the new datetime module rather than
  the time module.  As a result, the range of allowable dates
  has been increased.

4266 4267 4268
- pdb has a new 'j(ump)' command to select the next line to be
  executed.

4269 4270 4271
- The distutils created windows installers now can run a
  postinstallation script.

4272 4273 4274
- doctest.testmod can now be called without argument, which means to
  test the current module.

Raymond Hettinger's avatar
Raymond Hettinger committed
4275
- When canceling a server that implemented threading with a keyboard
4276 4277 4278 4279 4280
  interrupt, the server would shut down but not terminate (waiting on
  client threads). A new member variable, daemon_threads, was added to
  the ThreadingMixIn class in SocketServer.py to make it explicit that
  this behavior needs to be controlled.

Guido van Rossum's avatar
Guido van Rossum committed
4281
- A new module, optparse, provides a fancy alternative to getopt for
Andrew M. Kuchling's avatar
Andrew M. Kuchling committed
4282
  command line parsing.  It is a slightly modified version of Greg
Guido van Rossum's avatar
Guido van Rossum committed
4283 4284
  Ward's Optik package.

4285 4286 4287 4288 4289 4290 4291
- UserDict.py now defines a DictMixin class which defines all dictionary
  methods for classes that already have a minimum mapping interface.
  This greatly simplifies writing classes that need to be substitutable
  for dictionaries (such as the shelve module).

- shelve.py now subclasses from UserDict.DictMixin.  Now shelve supports
  all dictionary methods.  This eases the transition to persistent
Andrew M. Kuchling's avatar
Andrew M. Kuchling committed
4292
  storage for scripts originally written with dictionaries in mind.
4293

4294 4295 4296 4297
- shelve.open and the various classes in shelve.py now accept an optional
  binary flag, which defaults to False.  If True, the values stored in the
  shelf are binary pickles.

4298 4299 4300
- A new package, logging, implements the logging API defined by PEP
  282.  The code is written by Vinay Sajip.

4301 4302 4303
- StreamReader, StreamReaderWriter and StreamRecoder in the codecs
  modules are iterators now.

4304 4305 4306 4307 4308 4309
- gzip.py now handles files exceeding 2GB.  Files over 4GB also work
  now (provided the OS supports it, and Python is configured with large
  file support), but in that case the underlying gzip file format can
  record only the least-significant 32 bits of the file size, so that
  some tools working with gzipped files may report an incorrect file
  size.
4310

4311 4312 4313
- xml.sax.saxutils.unescape has been added, to replace entity references
  with their entity value.

4314 4315
- Queue.Queue.{put,get} now support an optional timeout argument.

4316 4317
- Various features of Tk 8.4 are exposed in Tkinter.py. The multiple
  option of tkFileDialog is exposed as function askopenfile{,name}s.
4318

4319 4320
- Various configure methods of Tkinter have been stream-lined, so that
  tag_configure, image_configure, window_configure now return a
4321
  dictionary when invoked with no argument.
4322

4323 4324 4325 4326 4327 4328 4329 4330 4331 4332 4333
- Importing the readline module now no longer has the side effect of
  calling setlocale(LC_CTYPE, "").  The initial "C" locale, or
  whatever locale is explicitly set by the user, is preserved.  If you
  want repr() of 8-bit strings in your preferred encoding to preserve
  all printable characters of that encoding, you have to add the
  following code to your $PYTHONSTARTUP file or to your application's
  main():

    import locale
    locale.setlocale(locale.LC_CTYPE, "")

4334 4335 4336
- shutil.move was added. shutil.copytree now reports errors as an
  exception at the end, instead of printing error messages.

4337 4338 4339 4340 4341 4342
- Encoding name normalization was generalized to not only
  replace hyphens with underscores, but also all other non-alphanumeric
  characters (with the exception of the dot which is used for Python
  package names during lookup). The aliases.py mapping was updated
  to the new standard.

4343 4344 4345 4346 4347
- mimetypes has two new functions: guess_all_extensions() which
  returns a list of all known extensions for a mime type, and
  add_type() which adds one mapping between a mime type and
  an extension to the database.

4348 4349 4350 4351
- New module: sets, defines the class Set that implements a mutable
  set type using the keys of a dict to represent the set.  There's
  also a class ImmutableSet which is useful when you need sets of sets
  or when you need to use sets as dict keys, and a class BaseSet which
4352
  is the base class of the two.
4353

4354
- Added random.sample(population,k) for random sampling without replacement.
4355
  Returns a k length list of unique elements chosen from the population.
4356

4357 4358 4359 4360 4361
- random.randrange(-sys.maxint-1, sys.maxint) no longer raises
  OverflowError.  That is, it now accepts any combination of 'start'
  and 'stop' arguments so long as each is in the range of Python's
  bounded integers.

4362 4363 4364 4365 4366 4367 4368 4369 4370 4371 4372 4373 4374 4375 4376 4377 4378 4379 4380
- Thanks to Raymond Hettinger, random.random() now uses a new core
  generator.  The Mersenne Twister algorithm is implemented in C,
  threadsafe, faster than the previous generator, has an astronomically
  large period (2**19937-1), creates random floats to full 53-bit
  precision, and may be the most widely tested random number generator
  in existence.

  The random.jumpahead(n) method has different semantics for the new
  generator.  Instead of jumping n steps ahead, it uses n and the
  existing state to create a new state.  This means that jumpahead()
  continues to support multi-threaded code needing generators of
  non-overlapping sequences.  However, it will break code which relies
  on jumpahead moving a specific number of steps forward.

  The attributes random.whseed and random.__whseed have no meaning for
  the new generator.  Code using these attributes should switch to a
  new class, random.WichmannHill which is provided for backward
  compatibility and to make an alternate generator available.

4381 4382 4383 4384
- New "algorithms" module: heapq, implements a heap queue.  Thanks to
  Kevin O'Connor for the code and François Pinard for an entertaining
  write-up explaining the theory and practical uses of heaps.

Marc-André Lemburg's avatar
Marc-André Lemburg committed
4385 4386
- New encoding for the Palm OS character set: palmos.

4387 4388 4389 4390 4391
- binascii.crc32() and the zipfile module had problems on some 64-bit
  platforms.  These have been fixed.  On a platform with 8-byte C longs,
  crc32() now returns a signed-extended 4-byte result, so that its value
  as a Python int is equal to the value computed a 32-bit platform.

4392 4393 4394
- xml.dom.minidom.toxml and toprettyxml now take an optional encoding
  argument.

Guido van Rossum's avatar
Guido van Rossum committed
4395 4396 4397 4398 4399 4400
- Some fixes in the copy module: when an object is copied through its
  __reduce__ method, there was no check for a __setstate__ method on
  the result [SF patch 565085]; deepcopy should treat instances of
  custom metaclasses the same way it treats instances of type 'type'
  [SF patch 560794].

4401 4402 4403
- Sockets now support timeout mode.  After s.settimeout(T), where T is
  a float expressing seconds, subsequent operations raise an exception
  if they cannot be completed within T seconds.  To disable timeout
4404 4405 4406
  mode, use s.settimeout(None).  There's also a module function,
  socket.setdefaulttimeout(T), which sets the default for all sockets
  created henceforth.
4407 4408 4409

- getopt.gnu_getopt was added.  This supports GNU-style option
  processing, where options can be mixed with non-option arguments.
4410

Michael W. Hudson's avatar
Michael W. Hudson committed
4411 4412 4413 4414
- Stop using strings for exceptions.  String objects used for
  exceptions are now classes deriving from Exception.  The objects
  changed were: Tkinter.TclError, bdb.BdbQuit, macpath.norm_error,
  tabnanny.NannyNag, and xdrlib.Error.
4415

4416 4417 4418 4419 4420 4421
- Constants BOM_UTF8, BOM_UTF16, BOM_UTF16_LE, BOM_UTF16_BE,
  BOM_UTF32, BOM_UTF32_LE and BOM_UTF32_BE that represent the Byte
  Order Mark in UTF-8, UTF-16 and UTF-32 encodings for little and
  big endian systems were added to the codecs module. The old names
  BOM32_* and BOM64_* were off by a factor of 2.

4422
- Added conversion functions math.degrees() and math.radians().
4423

4424 4425
- math.log() now takes an optional argument:  math.log(x[, base]).

4426 4427 4428 4429 4430 4431 4432
- ftplib.retrlines() now tests for callback is None rather than testing
  for False.  Was causing an error when given a callback object which
  was callable but also returned len() as zero.  The change may
  create new breakage if the caller relied on the undocumented behavior
  and called with callback set to [] or some other False value not
  identical to None.

4433 4434 4435 4436 4437 4438 4439
- random.gauss() uses a piece of hidden state used by nothing else,
  and the .seed() and .whseed() methods failed to reset it.  In other
  words, setting the seed didn't completely determine the sequence of
  results produced by random.gauss().  It does now.  Programs repeatedly
  mixing calls to a seed method with calls to gauss() may see different
  results now.

4440 4441 4442
- The pickle.Pickler class grew a clear_memo() method to mimic that
  provided by cPickle.Pickler.

4443 4444 4445 4446 4447 4448 4449 4450 4451
- difflib's SequenceMatcher class now does a dynamic analysis of
  which elements are so frequent as to constitute noise.  For
  comparing files as sequences of lines, this generally works better
  than the IS_LINE_JUNK function, and function ndiff's linejunk
  argument defaults to None now as a result.  A happy benefit is
  that SequenceMatcher may run much faster now when applied
  to large files with many duplicate lines (for example, C program
  text with lots of repeated "}" and "return NULL;" lines).

4452 4453
- New Text.dump() method in Tkinter module.

4454 4455 4456 4457 4458 4459 4460 4461 4462
- New distutils commands for building packagers were added to
  support pkgtool on Solaris and swinstall on HP-UX.

- distutils now has a new abstract binary packager base class
  command/bdist_packager, which simplifies writing packagers.
  This will hopefully provide the missing bits to encourage
  people to submit more packagers, e.g. for Debian, FreeBSD
  and other systems.

4463 4464 4465 4466
- The UTF-16, -LE and -BE stream readers now raise a
  NotImplementedError for all calls to .readline(). Previously, they
  used to just produce garbage or fail with an encoding error --
  UTF-16 is a 2-byte encoding and the C lib's line reading APIs don't
4467 4468
  work well with these.

4469 4470
- compileall now supports quiet operation.

Andrew M. Kuchling's avatar
Andrew M. Kuchling committed
4471
- The BaseHTTPServer now implements optional HTTP/1.1 persistent
4472 4473
  connections.

4474 4475 4476 4477
- socket module: the SSL support was broken out of the main
  _socket module C helper and placed into a new _ssl helper
  which now gets imported by socket.py if available and working.

4478 4479 4480
- encodings package: added aliases for all supported IANA character
  sets

4481 4482 4483 4484
- ftplib: to safeguard the user's privacy, anonymous login will use
  "anonymous@" as default password, rather than the real user and host
  name.

4485 4486 4487 4488
- webbrowser: tightened up the command passed to os.system() so that
  arbitrary shell code can't be executed because a bogus URL was
  passed in.

4489
- gettext.translation has an optional fallback argument, and
4490
  gettext.find an optional all argument. Translations will now fallback
4491 4492
  on a per-message basis. The module supports plural forms, by means
  of gettext.[d]ngettext and Translation.[u]ngettext.
4493

4494 4495
- distutils bdist commands now offer a --skip-build option.

Walter Dörwald's avatar
Walter Dörwald committed
4496 4497
- warnings.warn now accepts a Warning instance as first argument.

4498 4499 4500 4501
- The xml.sax.expatreader.ExpatParser class will no longer create
  circular references by using itself as the locator that gets passed
  to the content handler implementation.  [SF bug #535474]

4502 4503 4504 4505
- The email.Parser.Parser class now properly parses strings regardless
  of their line endings, which can be any of \r, \n, or \r\n (CR, LF,
  or CRLF).  Also, the Header class's constructor default arguments
  has changed slightly so that an explicit maxlinelen value is always
Barry Warsaw's avatar
Barry Warsaw committed
4506
  honored, and so unicode conversion error handling can be specified.
4507

David Goodger's avatar
David Goodger committed
4508
- distutils' build_ext command now links C++ extensions with the C++
4509
  compiler available in the Makefile or CXX environment variable, if
David Goodger's avatar
David Goodger committed
4510
  running under \*nix.
4511 4512 4513 4514 4515

- New module bz2: provides a comprehensive interface for the bz2 compression
  library.  It implements a complete file interface, one-shot (de)compression
  functions, and types for sequential (de)compression.

David Goodger's avatar
David Goodger committed
4516
- New pdb command 'pp' which is like 'p' except that it pretty-prints
4517 4518
  the value of its expression argument.

4519 4520 4521 4522
- Now bdist_rpm distutils command understands a verify_script option in
  the config file, including the contents of the referred filename in
  the "%verifyscript" section of the rpm spec file.

Gustavo Niemeyer's avatar
Gustavo Niemeyer committed
4523 4524 4525 4526
- Fixed bug #495695: webbrowser module would run graphic browsers in a
  unix environment even if DISPLAY was not set. Also, support for
  skipstone browser was included.

Gustavo Niemeyer's avatar
Gustavo Niemeyer committed
4527 4528 4529
- Fixed bug #636769: rexec would run unallowed code if subclasses of
  strings were used as parameters for certain functions.

Tim Peters's avatar
Tim Peters committed
4530
Tools/Demos
4531
-----------
Tim Peters's avatar
Tim Peters committed
4532

4533 4534 4535
- pygettext.py now supports globbing on Windows, and accepts module
  names in addition to accepting file names.

Guido van Rossum's avatar
Guido van Rossum committed
4536 4537 4538 4539 4540
- The SGI demos (Demo/sgi) have been removed.  Nobody thought they
  were interesting any more.  (The SGI library modules and extensions
  are still there; it is believed that at least some of these are
  still used and useful.)

Fred Drake's avatar
Fred Drake committed
4541 4542 4543 4544
- IDLE supports the new encoding declarations (PEP 263); it can also
  deal with legacy 8-bit files if they use the locale's encoding. It
  allows non-ASCII strings in the interactive shell and executes them
  in the locale's encoding.
Martin v. Löwis's avatar
Martin v. Löwis committed
4545

4546 4547 4548 4549
- freeze.py now produces binaries which can import shared modules,
  unlike before when this failed due to missing symbol exports in
  the generated binary.

Tim Peters's avatar
Tim Peters committed
4550
Build
4551
-----
Tim Peters's avatar
Tim Peters committed
4552

4553 4554
- On Unix, IDLE is now installed automatically.

4555 4556 4557
- The fpectl module is not built by default; it's dangerous or useless
  except in the hands of experts.

4558
- The public Python C API will generally be declared using PyAPI_FUNC
4559 4560 4561
  and PyAPI_DATA macros, while Python extension module init functions
  will be declared with PyMODINIT_FUNC.  DL_EXPORT/DL_IMPORT macros
  are deprecated.
4562

4563 4564 4565 4566 4567 4568 4569 4570
- A bug was fixed that could cause COUNT_ALLOCS builds to segfault, or
  get into infinite loops, when a new-style class got garbage-collected.
  Unfortunately, to avoid this, the way COUNT_ALLOCS works requires
  that new-style classes be immortal in COUNT_ALLOCS builds.  Note that
  COUNT_ALLOCS is not enabled by default, in either release or debug
  builds, and that new-style classes are immortal only in COUNT_ALLOCS
  builds.

4571 4572 4573 4574 4575 4576 4577 4578 4579
- Compiling out the cyclic garbage collector is no longer an option.
  The old symbol WITH_CYCLE_GC is now ignored, and Python.h arranges
  that it's always defined (for the benefit of any extension modules
  that may be conditionalizing on it).  A bonus is that any extension
  type participating in cyclic gc can choose to participate in the
  Py_TRASHCAN mechanism now too; in the absence of cyclic gc, this used
  to require editing the core to teach the trashcan mechanism about the
  new type.

4580
- According to Annex F of the current C standard,
4581 4582 4583 4584 4585 4586 4587 4588 4589 4590 4591 4592 4593 4594 4595 4596 4597

    The Standard C macro HUGE_VAL and its float and long double analogs,
    HUGE_VALF and HUGE_VALL, expand to expressions whose values are
    positive infinities.

  Python only uses the double HUGE_VAL, and only to #define its own symbol
  Py_HUGE_VAL.  Some platforms have incorrect definitions for HUGE_VAL.
  pyport.h used to try to worm around that, but the workarounds triggered
  other bugs on other platforms, so we gave up.  If your platform defines
  HUGE_VAL incorrectly, you'll need to #define Py_HUGE_VAL to something
  that works on your platform.  The only instance of this I'm sure about
  is on an unknown subset of Cray systems, described here:

  http://www.cray.com/swpubs/manuals/SN-2194_2.0/html-SN-2194_2.0/x3138.htm

  Presumably 2.3a1 breaks such systems.  If anyone uses such a system, help!

4598 4599 4600 4601
- The configure option --without-doc-strings can be used to remove the
  doc strings from the builtin functions and modules; this reduces the
  size of the executable.

4602 4603 4604 4605
- The universal newlines option (PEP 278) is on by default.  On Unix
  it can be disabled by passing --without-universal-newlines to the
  configure script.  On other platforms, remove
  WITH_UNIVERSAL_NEWLINES from pyconfig.h.
4606

4607 4608
- On Unix, a shared libpython2.3.so can be created with --enable-shared.

4609 4610 4611
- All uses of the CACHE_HASH, INTERN_STRINGS, and DONT_SHARE_SHORT_STRINGS
  preprocessor symbols were eliminated.  The internal decisions they
  controlled stopped being experimental long ago.
4612

4613 4614 4615
- The tools used to build the documentation now work under Cygwin as
  well as Unix.

4616 4617 4618 4619 4620
- The bsddb and dbm module builds have been changed to try and avoid version
  skew problems and disable linkage with Berkeley DB 1.85 unless the
  installer knows what s/he's doing.  See the section on building these
  modules in the README file for details.

Tim Peters's avatar
Tim Peters committed
4621
C API
4622
-----
Tim Peters's avatar
Tim Peters committed
4623

4624 4625
- PyNumber_Check() now returns true for string and unicode objects.
  This is a result of these types having a partially defined
4626
  tp_as_number slot.  (This is not a feature, but an indication that
Walter Dörwald's avatar
Walter Dörwald committed
4627
  PyNumber_Check() is not very useful to determine numeric behavior.
4628
  It may be deprecated.)
4629

4630 4631 4632 4633 4634 4635
- The string object's layout has changed: the pointer member
  ob_sinterned has been replaced by an int member ob_sstate.  On some
  platforms (e.g. most 64-bit systems) this may change the offset of
  the ob_sval member, so as a precaution the API_VERSION has been
  incremented.  The apparently unused feature of "indirect interned
  strings", supported by the ob_sinterned member, is gone.  Interned
Raymond Hettinger's avatar
Raymond Hettinger committed
4636
  strings are now usually mortal; there is a new API,
4637 4638 4639 4640 4641 4642
  PyString_InternImmortal() that creates immortal interned strings.
  (The ob_sstate member can only take three values; however, while
  making it a char saves a few bytes per string object on average, in
  it also slowed things down a bit because ob_sval was no longer
  aligned.)

4643 4644 4645 4646
- The Py_InitModule*() functions now accept NULL for the 'methods'
  argument.  Modules without global functions are becoming more common
  now that factories can be types rather than functions.

4647 4648 4649
- New C API PyUnicode_FromOrdinal() which exposes unichr() at C
  level.

4650 4651 4652 4653 4654 4655
- New functions PyErr_SetExcFromWindowsErr() and
  PyErr_SetExcFromWindowsErrWithFilename(). Similar to
  PyErr_SetFromWindowsErrWithFilename() and
  PyErr_SetFromWindowsErr(), but they allow to specify
  the exception type to raise. Available on Windows.

4656 4657 4658 4659
- Py_FatalError() is now declared as taking a const char* argument.  It
  was previously declared without const.  This should not affect working
  code.

4660 4661 4662 4663
- Added new macro PySequence_ITEM(o, i) that directly calls
  sq_item without rechecking that o is a sequence and without
  adjusting for negative indices.

4664 4665 4666 4667
- PyRange_New() now raises ValueError if the fourth argument is not 1.
  This is part of the removal of deprecated features of the xrange
  object.

4668 4669 4670 4671
- PyNumber_Coerce() and PyNumber_CoerceEx() now also invoke the type's
  coercion if both arguments have the same type but this type has the
  CHECKTYPES flag set.  This is to better support proxies.

4672 4673
- The type of tp_free has been changed from "``void (*)(PyObject *)``" to
  "``void (*)(void *)``".
4674 4675 4676

- PyObject_Del, PyObject_GC_Del are now functions instead of macros.

4677 4678 4679 4680 4681 4682 4683
- A type can now inherit its metatype from its base type.  Previously,
  when PyType_Ready() was called, if ob_type was found to be NULL, it
  was always set to &PyType_Type; now it is set to base->ob_type,
  where base is tp_base, defaulting to &PyObject_Type.

- PyType_Ready() accidentally did not inherit tp_is_gc; now it does.

4684
- The PyCore_* family of APIs have been removed.
4685

Andrew M. Kuchling's avatar
Andrew M. Kuchling committed
4686
- The "u#" parser marker will now pass through Unicode objects as-is
4687 4688
  without going through the buffer API.

4689
- The enumerators of cmp_op have been renamed to use the prefix ``PyCmp_``.
4690

4691 4692 4693 4694 4695
- An old #define of ANY as void has been removed from pyport.h.  This
  hasn't been used since Python's pre-ANSI days, and the #define has
  been marked as obsolete since then.  SF bug 495548 says it created
  conflicts with other packages, so keeping it around wasn't harmless.

Tim Peters's avatar
Tim Peters committed
4696 4697 4698
- Because Python's magic number scheme broke on January 1st, we decided
  to stop Python development.  Thanks for all the fish!

4699
- Some of us don't like fish, so we changed Python's magic number
4700 4701
  scheme to a new one. See Python/import.c for details.

Tim Peters's avatar
Tim Peters committed
4702
New platforms
4703
-------------
Tim Peters's avatar
Tim Peters committed
4704

4705 4706
- OpenVMS is now supported.

Martin v. Löwis's avatar
Martin v. Löwis committed
4707 4708
- AtheOS is now supported.

Andrew MacIntyre's avatar
Andrew MacIntyre committed
4709 4710
- the EMX runtime environment on OS/2 is now supported.

4711 4712
- GNU/Hurd is now supported.

Tim Peters's avatar
Tim Peters committed
4713
Tests
4714 4715
-----

4716 4717 4718
- The regrtest.py script's -u option now provides a way to say "allow
  all resources except this one."  For example, to allow everything
  except bsddb, give the option '-uall,-bsddb'.
Tim Peters's avatar
Tim Peters committed
4719 4720

Windows
4721
-------
Tim Peters's avatar
Tim Peters committed
4722

Tim Peters's avatar
Tim Peters committed
4723 4724 4725 4726 4727
- The Windows distribution now ships with version 4.0.14 of the
  Sleepycat Berkeley database library.  This should be a huge
  improvement over the previous Berkeley DB 1.85, which had many
  bugs.
  XXX What are the licensing issues here?
Raymond Hettinger's avatar
Raymond Hettinger committed
4728
  XXX If a user has a database created with a previous version of
4729 4730
  XXX     Python, what must they do to convert it?
  XXX I'm still not sure how to link this thing (see PCbuild/readme.txt).
Tim Peters's avatar
Tim Peters committed
4731
  XXX The version # is likely to change before 2.3a1.
Tim Peters's avatar
Tim Peters committed
4732 4733

- The Windows distribution now ships with a Secure Sockets Library (SLL)
Mark Hammond's avatar
Mark Hammond committed
4734
   module (_ssl.pyd)
Tim Peters's avatar
Tim Peters committed
4735

4736 4737 4738
- The Windows distribution now ships with Tcl/Tk version 8.4.1 (it
  previously shipped with Tcl/Tk 8.3.2).

4739 4740
- When Python is built under a Microsoft compiler, sys.version now
  includes the compiler version number (_MSC_VER).  For example, under
Raymond Hettinger's avatar
Raymond Hettinger committed
4741
  MSVC 6, sys.version contains the substring "MSC v.1200 ".  1200 is
4742 4743
  the value of _MSC_VER under MSVC 6.

4744 4745 4746 4747
- Sometimes the uninstall executable (UNWISE.EXE) vanishes.  One cause
  of that has been fixed in the installer (disabled Wise's "delete in-
  use files" uninstall option).

Guido van Rossum's avatar
Guido van Rossum committed
4748 4749
- Fixed a bug in urllib's proxy handling in Windows.  [SF bug #503031]

4750 4751 4752
- The installer now installs Start menu shortcuts under (the local
  equivalent of) "All Users" when doing an Admin install.

4753 4754 4755 4756
- file.truncate([newsize]) now works on Windows for all newsize values.
  It used to fail if newsize didn't fit in 32 bits, reflecting a
  limitation of MS _chsize (which is no longer used).

4757 4758 4759 4760 4761
- os.waitpid() is now implemented for Windows, and can be used to block
  until a specified process exits.  This is similar to, but not exactly
  the same as, os.waitpid() on POSIX systems.  If you're waiting for
  a specific process whose pid was obtained from one of the spawn()
  functions, the same Python os.waitpid() code works across platforms.
4762 4763 4764
  See the docs for details.  The docs were changed to clarify that
  spawn functions return, and waitpid requires, a process handle on
  Windows (not the same thing as a Windows process id).
4765

4766
- New tempfile.TemporaryFile implementation for Windows:  this doesn't
Andrew M. Kuchling's avatar
Andrew M. Kuchling committed
4767
  need a TemporaryFileWrapper wrapper anymore, and should be immune
4768 4769 4770 4771 4772 4773 4774 4775 4776 4777 4778 4779 4780 4781 4782 4783 4784 4785 4786 4787 4788
  to a nasty problem:  before 2.3, if you got a temp file on Windows, it
  got wrapped in an object whose close() method first closed the
  underlying file, then deleted the file.  This usually worked fine.
  However, the spawn family of functions on Windows create (at a low C
  level) the same set of open files in the spawned process Q as were
  open in the spawning process P.  If a temp file f was among them, then
  doing f.close() in P first closed P's C-level file handle on f, but Q's
  C-level file handle on f remained open, so the attempt in P to delete f
  blew up with a "Permission denied" error (Windows doesn't allow
  deleting open files).  This was surprising, subtle, and difficult to
  work around.

- The os module now exports all the symbolic constants usable with the
  low-level os.open() on Windows:  the new constants in 2.3 are
  O_NOINHERIT, O_SHORT_LIVED, O_TEMPORARY, O_RANDOM and O_SEQUENTIAL.
  The others were also available in 2.2:  O_APPEND, O_BINARY, O_CREAT,
  O_EXCL, O_RDONLY, O_RDWR, O_TEXT, O_TRUNC and O_WRONLY.  Contrary
  to Microsoft docs, O_SHORT_LIVED does not seem to imply O_TEMPORARY
  (so specify both if you want both; note that neither is useful unless
  specified with O_CREAT too).

Tim Peters's avatar
Tim Peters committed
4789
Mac
4790 4791
----

4792 4793
- Mac/Relnotes is gone, the release notes are now here.

4794 4795 4796 4797
- Python (the OSX-only, unix-based version, not the OS9-compatible CFM
  version) now fully supports unicode strings as arguments to various file
  system calls, eg. open(), file(), os.stat() and os.listdir().

4798 4799 4800 4801 4802 4803 4804 4805 4806 4807 4808 4809
- The current naming convention for Python on the Macintosh is that MacPython
  refers to the unix-based OSX-only version, and MacPython-OS9 refers to the
  CFM-based version that runs on both OS9 and OSX.

- All MacPython-OS9 functionality is now available in an OSX unix build,
  including the Carbon modules, the IDE, OSA support, etc. A lot of this
  will only work correctly in a framework build, though, because you cannot
  talk to the window manager unless your application is run from a .app
  bundle. There is a command line tool "pythonw" that runs your script
  with an interpreter living in such a .app bundle, this interpreter should
  be used to run any Python script using the window manager (including
  Tkinter or wxPython scripts).
4810

4811 4812 4813
- Most of Mac/Lib has moved to Lib/plat-mac, which is again used both in
  MacPython-OSX and MacPython-OS9. The only modules remaining in Mac/Lib
  are specifically for MacPython-OS9 (CFM support, preference resources, etc).
4814

4815 4816 4817 4818 4819
- A new utility PythonLauncher will start a Python interpreter when a .py or
  .pyw script is double-clicked in the Finder. By default .py scripts are
  run with a normal Python interpreter in a Terminal window and .pyw
  files are run with a window-aware pythonw interpreter without a Terminal
  window, but all this can be customized.
4820

4821 4822 4823
- MacPython-OS9 is now Carbon-only, so it runs on Mac OS 9 or Mac OS X and
  possibly on Mac OS 8.6 with the right CarbonLib installed, but not on earlier
  releases.
4824

4825 4826
- Many tools such as BuildApplet.py and gensuitemodule.py now support a command
  line interface too.
4827

4828 4829 4830 4831 4832
- All the Carbon classes are now PEP253 compliant, meaning that you can
  subclass them from Python. Most of the attributes have gone, you should
  now use the accessor function call API, which is also what Apple's
  documentation uses. Some attributes such as grafport.visRgn are still
  available for convenience.
4833

4834 4835 4836 4837 4838 4839 4840
- New Carbon modules File (implementing the APIs in Files.h and Aliases.h)
  and Folder (APIs from Folders.h). The old macfs builtin module is
  gone, and replaced by a Python wrapper around the new modules.

- Pathname handling should now be fully consistent: MacPython-OSX always uses
  unix pathnames and MacPython-OS9 always uses colon-separated Mac pathnames
  (also when running on Mac OS X).
4841

4842 4843 4844 4845
- New Carbon modules Help and AH give access to the Carbon Help Manager.
  There are hooks in the IDE to allow accessing the Python documentation
  (and Apple's Carbon and Cocoa documentation) through the Help Viewer.
  See Mac/OSX/README for converting the Python documentation to a
Raymond Hettinger's avatar
Raymond Hettinger committed
4846
  Help Viewer compatible form and installing it.
4847

4848 4849
- OSA support has been redesigned and the generated Python classes now
  mirror the inheritance defined by the underlying OSA classes.
4850

4851 4852
- MacPython no longer maps both \r and \n to \n on input for any text file.
  This feature has been replaced by universal newline support (PEP278).
4853

4854
- The default encoding for Python sourcefiles in MacPython-OS9 is no longer
Guido van Rossum's avatar
Guido van Rossum committed
4855
  mac-roman (or whatever your local Mac encoding was) but "ascii", like on
4856 4857
  other platforms. If you really need sourcefiles with Mac characters in them
  you can change this in site.py.
Tim Peters's avatar
Tim Peters committed
4858

Guido van Rossum's avatar
Guido van Rossum committed
4859

4860 4861 4862
What's New in Python 2.2 final?
===============================

4863 4864
*Release date: 21-Dec-2001*

4865
Type/class unification and new-style classes
4866
--------------------------------------------
4867

4868 4869 4870
- pickle.py, cPickle: allow pickling instances of new-style classes
  with a custom metaclass.

4871
Core and builtins
4872
-----------------
4873

4874 4875 4876
- weakref proxy object: when comparing, unwrap both arguments if both
  are proxies.

4877
Extension modules
4878
-----------------
4879

4880 4881 4882 4883 4884 4885 4886 4887 4888
- binascii.b2a_base64(): fix a potential buffer overrun when encoding
  very short strings.

- cPickle: the obscure "fast" mode was suspected of causing stack
  overflows on the Mac.  Hopefully fixed this by setting the recursion
  limit much smaller.  If the limit is too low (it only affects
  performance), you can change it by defining PY_CPICKLE_FAST_LIMIT
  when compiling cPickle.c (or in pyconfig.h).

4889
Library
4890
-------
4891

4892 4893 4894 4895 4896 4897 4898 4899 4900
- dumbdbm.py: fixed a dumb old bug (the file didn't get synched at
  close or delete time).

- rfc822.py: fixed a bug where the address '<>' was converted to None
  instead of an empty string (also fixes the email.Utils module).

- xmlrpclib.py: version 1.0.0; uses precision for doubles.

- test suite: the pickle and cPickle tests were not executing any code
Andrew M. Kuchling's avatar
Andrew M. Kuchling committed
4901
  when run from the standard regression test.
4902

4903
Tools/Demos
4904
-----------
4905 4906

Build
4907
-----
4908 4909

C API
4910
-----
4911 4912

New platforms
4913
-------------
4914 4915

Tests
4916
-----
4917 4918

Windows
4919
-------
4920

4921 4922 4923 4924 4925 4926 4927 4928 4929 4930 4931
- distutils package: fixed broken Windows installers (bdist_wininst).

- tempfile.py: prevent mysterious warnings when TemporaryFileWrapper
  instances are deleted at process exit time.

- socket.py: prevent mysterious warnings when socket instances are
  deleted at process exit time.

- posixmodule.c: fix a Windows crash with stat() of a filename ending
  in backslash.

4932
Mac
4933
----
4934

4935 4936 4937 4938
- The Carbon toolbox modules have been upgraded to Universal Headers
  3.4, and experimental CoreGraphics and CarbonEvents modules have
  been added.  All only for framework-enabled MacOSX.

4939

Tim Peters's avatar
Tim Peters committed
4940
What's New in Python 2.2c1?
Tim Peters's avatar
Tim Peters committed
4941 4942
===========================

4943 4944
*Release date: 14-Dec-2001*

Tim Peters's avatar
Tim Peters committed
4945
Type/class unification and new-style classes
4946
--------------------------------------------
Tim Peters's avatar
Tim Peters committed
4947

4948 4949 4950 4951 4952 4953 4954
- Guido's tutorial introduction to the new type/class features has
  been extensively updated.  See

      http://www.python.org/2.2/descrintro.html

  That remains the primary documentation in this area.

4955 4956 4957
- Fixed a leak: instance variables declared with __slots__ were never
  deleted!

Guido van Rossum's avatar
Guido van Rossum committed
4958 4959 4960 4961 4962 4963
- The "delete attribute" method of descriptor objects is called
  __delete__, not __del__.  In previous releases, it was mistakenly
  called __del__, which created an unfortunate overloading condition
  with finalizers.  (The "get attribute" and "set attribute" methods
  are still called __get__ and __set__, respectively.)

Guido van Rossum's avatar
Guido van Rossum committed
4964 4965 4966 4967 4968 4969 4970 4971 4972 4973 4974 4975 4976 4977
- Some subtle issues with the super built-in were fixed:

  (a) When super itself is subclassed, its __get__ method would still
      return an instance of the base class (i.e., of super).

  (b) super(C, C()).__class__ would return C rather than super.  This
      is confusing.  To fix this, I decided to change the semantics of
      super so that it only applies to code attributes, not to data
      attributes.  After all, overriding data attributes is not
      supported anyway.

  (c) The __get__ method didn't check whether the argument was an
      instance of the type used in creation of the super instance.

4978 4979 4980 4981 4982
- Previously, hash() of an instance of a subclass of a mutable type
  (list or dictionary) would return some value, rather than raising
  TypeError.  This has been fixed.  Also, directly calling
  dict.__hash__ and list.__hash__ now raises the same TypeError
  (previously, these were the same as object.__hash__).
Guido van Rossum's avatar
Guido van Rossum committed
4983

4984 4985 4986 4987
- New-style objects now support deleting their __dict__.  This is for
  all intents and purposes equivalent to assigning a brand new empty
  dictionary, but saves space if the object is not used further.

Tim Peters's avatar
Tim Peters committed
4988
Core and builtins
4989
-----------------
Tim Peters's avatar
Tim Peters committed
4990

Tim Peters's avatar
Tim Peters committed
4991 4992 4993 4994 4995 4996
- -Qnew now works as documented in PEP 238:  when -Qnew is passed on
  the command line, all occurrences of "/" use true division instead
  of classic division.  See the PEP for details.  Note that "all"
  means all instances in library and 3rd-party modules, as well as in
  your own code.  As the PEP says, -Qnew is intended for use only in
  educational environments with control over the libraries in use.
4997 4998 4999 5000
  Note that test_coercion.py in the standard Python test suite fails
  under -Qnew; this is expected, and won't be repaired until true
  division becomes the default (in the meantime, test_coercion is
  testing the current rules).
Tim Peters's avatar
Tim Peters committed
5001

5002 5003 5004 5005
- complex() now only allows the first argument to be a string
  argument, and raises TypeError if either the second arg is a string
  or if the second arg is specified when the first is a string.

Tim Peters's avatar
Tim Peters committed
5006
Extension modules
5007
-----------------
Tim Peters's avatar
Tim Peters committed
5008

5009 5010
- gc.get_referents was renamed to gc.get_referrers.

Tim Peters's avatar
Tim Peters committed
5011
Library
5012
-------
Tim Peters's avatar
Tim Peters committed
5013

5014 5015 5016 5017 5018 5019 5020
- Functions in the os.spawn() family now release the global interpreter
  lock around calling the platform spawn.  They should always have done
  this, but did not before 2.2c1.  Multithreaded programs calling
  an os.spawn function with P_WAIT will no longer block all Python threads
  until the spawned program completes.  It's possible that some programs
  relies on blocking, although more likely by accident than by design.

5021 5022
- webbrowser defaults to netscape.exe on OS/2 now.

5023 5024
- Tix.ResizeHandle exposes detach_widget, hide, and show.

5025 5026
- The charset alias windows_1252 has been added.

5027 5028 5029 5030 5031 5032
- types.StringTypes is a tuple containing the defined string types;
  usually this will be (str, unicode), but if Python was compiled
  without Unicode support it will be just (str,).

- The pulldom and minidom modules were synchronized to PyXML.

Tim Peters's avatar
Tim Peters committed
5033
Tools/Demos
5034
-----------
Tim Peters's avatar
Tim Peters committed
5035

5036 5037 5038
- A new script called Tools/scripts/google.py was added, which fires
  off a search on Google.

Tim Peters's avatar
Tim Peters committed
5039
Build
5040
-----
Tim Peters's avatar
Tim Peters committed
5041

5042 5043 5044 5045 5046 5047 5048 5049
- Note that release builds of Python should arrange to define the
  preprocessor symbol NDEBUG on the command line (or equivalent).
  In the 2.2 pre-release series we tried to define this by magic in
  Python.h instead, but it proved to cause problems for extension
  authors.  The Unix, Windows and Mac builds now all define NDEBUG in
  release builds via cmdline (or equivalent) instead.  Ports to
  other platforms should do likewise.

5050 5051 5052 5053
- It is no longer necessary to use --with-suffix when building on a
  case-insensitive file system (such as Mac OS X HFS+). In the build
  directory an extension is used, but not in the installed python.

Tim Peters's avatar
Tim Peters committed
5054
C API
5055
-----
Tim Peters's avatar
Tim Peters committed
5056

5057 5058 5059 5060
- New function PyDict_MergeFromSeq2() exposes the builtin dict
  constructor's logic for updating a dictionary from an iterable object
  producing key-value pairs.

5061
- PyArg_ParseTupleAndKeywords() requires that the number of entries in
5062
  the keyword list equal the number of argument specifiers.  This
5063 5064 5065 5066 5067
  wasn't checked correctly, and PyArg_ParseTupleAndKeywords could even
  dump core in some bad cases.  This has been repaired.  As a result,
  PyArg_ParseTupleAndKeywords may raise RuntimeError in bad cases that
  previously went unchallenged.

Tim Peters's avatar
Tim Peters committed
5068
New platforms
5069
-------------
Tim Peters's avatar
Tim Peters committed
5070 5071

Tests
5072
-----
Tim Peters's avatar
Tim Peters committed
5073 5074

Windows
5075
-------
Tim Peters's avatar
Tim Peters committed
5076 5077

Mac
5078
----
Tim Peters's avatar
Tim Peters committed
5079

5080 5081
- In unix-Python on Mac OS X (and darwin) sys.platform is now "darwin",
  without any trailing digits.
Tim Peters's avatar
Tim Peters committed
5082

5083 5084 5085 5086 5087 5088
- Changed logic for finding python home in Mac OS X framework Pythons.
  Now sys.executable points to the executable again, in stead of to
  the shared library. The latter is used only for locating the python
  home.


Tim Peters's avatar
Tim Peters committed
5089 5090 5091
What's New in Python 2.2b2?
===========================

5092 5093
*Release date: 16-Nov-2001*

Tim Peters's avatar
Tim Peters committed
5094
Type/class unification and new-style classes
5095
--------------------------------------------
Tim Peters's avatar
Tim Peters committed
5096

5097 5098
- Multiple inheritance mixing new-style and classic classes in the
  list of base classes is now allowed, so this works now:
5099 5100

      class Classic: pass
5101
      class Mixed(Classic, object): pass
5102 5103 5104

  The MRO (method resolution order) for each base class is respected
  according to its kind, but the MRO for the derived class is computed
Andrew M. Kuchling's avatar
Andrew M. Kuchling committed
5105
  using new-style MRO rules if any base class is a new-style class.
5106 5107
  This needs to be documented.

5108 5109 5110
- The new builtin dictionary() constructor, and dictionary type, have
  been renamed to dict.  This reflects a decade of common usage.

5111 5112 5113 5114
- dict() now accepts an iterable object producing 2-sequences.  For
  example, dict(d.items()) == d for any dictionary d.  The argument,
  and the elements of the argument, can be any iterable objects.

5115 5116 5117
- New-style classes can now have a __del__ method, which is called
  when the instance is deleted (just like for classic classes).

Guido van Rossum's avatar
Guido van Rossum committed
5118 5119 5120 5121
- Assignment to object.__dict__ is now possible, for objects that are
  instances of new-style classes that have a __dict__ (unless the base
  class forbids it).

5122 5123 5124 5125
- Methods of built-in types now properly check for keyword arguments
  (formerly these were silently ignored).  The only built-in methods
  that take keyword arguments are __call__, __init__ and __new__.

5126 5127
- The socket function has been converted to a type; see below.

Tim Peters's avatar
Tim Peters committed
5128
Core and builtins
5129
-----------------
Tim Peters's avatar
Tim Peters committed
5130

5131 5132
- Assignment to __debug__ raises SyntaxError at compile-time.  This
  was promised when 2.1c1 was released as "What's New in Python 2.1c1"
5133
  (see below) says.
5134

Guido van Rossum's avatar
Guido van Rossum committed
5135 5136 5137
- Clarified the error messages for unsupported operands to an operator
  (like 1 + '').

Tim Peters's avatar
Tim Peters committed
5138
Extension modules
5139
-----------------
Tim Peters's avatar
Tim Peters committed
5140

5141 5142 5143 5144
- mmap has a new keyword argument, "access", allowing a uniform way for
  both Windows and Unix users to create read-only, write-through and
  copy-on-write memory mappings.  This was previously possible only on
  Unix.  A new keyword argument was required to support this in a
Andrew M. Kuchling's avatar
Andrew M. Kuchling committed
5145
  uniform way because the mmap() signatures had diverged across
5146 5147
  platforms.  Thanks to Jay T Miller for repairing this!

5148 5149 5150 5151 5152
- By default, the gc.garbage list now contains only those instances in
  unreachable cycles that have __del__ methods; in 2.1 it contained all
  instances in unreachable cycles.  "Instances" here has been generalized
  to include instances of both new-style and old-style classes.

Guido van Rossum's avatar
Guido van Rossum committed
5153 5154
- The socket module defines a new method for socket objects,
  sendall().  This is like send() but may make multiple calls to
5155 5156 5157
  send() until all data has been sent.  Also, the socket function has
  been converted to a subclassable type, like list and tuple (etc.)
  before it; socket and SocketType are now the same thing.
Guido van Rossum's avatar
Guido van Rossum committed
5158 5159 5160

- Various bugfixes to the curses module.  There is now a test suite
  for the curses module (you have to run it manually).
5161

5162 5163 5164
- binascii.b2a_base64 no longer places an arbitrary restriction of 57
  bytes on its input.

Tim Peters's avatar
Tim Peters committed
5165
Library
5166
-------
Tim Peters's avatar
Tim Peters committed
5167

5168
- tkFileDialog exposes a Directory class and askdirectory
5169 5170
  convenience function.

5171 5172 5173
- Symbolic group names in regular expressions must be unique.  For
  example, the regexp r'(?P<abc>)(?P<abc>)' is not allowed, because a
  single name can't mean both "group 1" and "group 2" simultaneously.
5174 5175 5176 5177 5178 5179
  Python 2.2 detects this error at regexp compilation time;
  previously, the error went undetected, and results were
  unpredictable.  Also in sre, the pattern.split(), pattern.sub(), and
  pattern.subn() methods have been rewritten in C.  Also, an
  experimental function/method finditer() has been added, which works
  like findall() but returns an iterator.
5180

5181 5182 5183 5184 5185
- Tix exposes more commands through the classes DirSelectBox,
  DirSelectDialog, ListNoteBook, Meter, CheckList, and the
  methods tix_addbitmapdir, tix_cget, tix_configure, tix_filedialog,
  tix_getbitmap, tix_getimage, tix_option_get, and tix_resetoptions.

5186 5187 5188 5189
- Traceback objects are now scanned by cyclic garbage collection, so
  cycles created by casual use of sys.exc_info() no longer cause
  permanent memory leaks (provided garbage collection is enabled).

5190 5191 5192 5193 5194
- os.extsep -- a new variable needed by the RISCOS support.  It is the
  separator used by extensions, and is '.' on all platforms except
  RISCOS, where it is '/'.  There is no need to use this variable
  unless you have a masochistic desire to port your code to RISCOS.

5195 5196
- mimetypes.py has optional support for non-standard, but commonly
  found types.  guess_type() and guess_extension() now accept an
5197
  optional 'strict' flag, defaulting to true, which controls whether
5198 5199 5200 5201 5202 5203 5204 5205
  recognize non-standard types or not.  A few non-standard types we
  know about have been added.  Also, when run as a script, there are
  new -l and -e options.

- statcache is now deprecated.

- email.Utils.formatdate() now produces the preferred RFC 2822 style
  dates with numeric timezones (it used to produce obsolete dates
5206
  hard coded to "GMT" timezone).  An optional 'localtime' flag is
5207 5208 5209 5210 5211 5212 5213 5214
  added to produce dates in the local timezone, with daylight savings
  time properly taken into account.

- In pickle and cPickle, instead of masking errors in load() by
  transforming them into SystemError, we let the original exception
  propagate out.  Also, implement support for __safe_for_unpickling__
  in pickle, as it already was supported in cPickle.

Tim Peters's avatar
Tim Peters committed
5215
Tools/Demos
5216
-----------
Tim Peters's avatar
Tim Peters committed
5217 5218

Build
5219
-----
Tim Peters's avatar
Tim Peters committed
5220

5221 5222 5223 5224 5225
- The dbm module is built using libdb1 if available.  The bsddb module
  is built with libdb3 if available.

- Misc/Makefile.pre.in has been removed by BDFL pronouncement.

Tim Peters's avatar
Tim Peters committed
5226
C API
5227
-----
Tim Peters's avatar
Tim Peters committed
5228

5229 5230 5231 5232
- New function PySequence_Fast_GET_SIZE() returns the size of a non-
  NULL result from PySequence_Fast(), more quickly than calling
  PySequence_Size().

5233 5234 5235 5236 5237 5238 5239 5240 5241 5242 5243 5244
- New argument unpacking function PyArg_UnpackTuple() added.

- New functions PyObject_CallFunctionObjArgs() and
  PyObject_CallMethodObjArgs() have been added to make it more
  convenient and efficient to call functions and methods from C.

- PyArg_ParseTupleAndKeywords() no longer masks errors, so it's
  possible that this will propagate errors it didn't before.

- New function PyObject_CheckReadBuffer(), which returns true if its
  argument supports the single-segment readable buffer interface.

Tim Peters's avatar
Tim Peters committed
5245
New platforms
5246
-------------
Tim Peters's avatar
Tim Peters committed
5247

5248 5249 5250
- We've finally confirmed that this release builds on HP-UX 11.00,
  *with* threads, and passes the test suite.

5251 5252 5253
- Thanks to a series of patches from Michael Muller, Python may build
  again under OS/2 Visual Age C++.

5254 5255
- Updated RISCOS port by Dietmar Schwertberger.

Tim Peters's avatar
Tim Peters committed
5256
Tests
5257
-----
Tim Peters's avatar
Tim Peters committed
5258

5259 5260 5261
- Added a test script for the curses module.  It isn't run automatically;
  regrtest.py must be run with '-u curses' to enable it.

Tim Peters's avatar
Tim Peters committed
5262
Windows
5263
-------
Tim Peters's avatar
Tim Peters committed
5264

5265
Mac
5266
----
5267 5268 5269 5270 5271 5272 5273 5274 5275 5276 5277 5278

- PythonScript has been moved to unsupported and is slated to be
  removed completely in the next release.

- It should now be possible to build applets that work on both OS9 and
  OSX.

- The core is now linked with CoreServices not Carbon; as a side
  result, default 8bit encoding on OSX is now ASCII.

- Python should now build on OSX 10.1.1

Tim Peters's avatar
Tim Peters committed
5279

5280 5281 5282
What's New in Python 2.2b1?
===========================

5283 5284
*Release date: 19-Oct-2001*

5285
Type/class unification and new-style classes
5286
--------------------------------------------
5287

5288
- New-style classes are now always dynamic (except for built-in and
5289
  extension types).  There is no longer a performance penalty, and I
5290
  no longer see another reason to keep this baggage around.  One relic
5291 5292
  remains: the __dict__ of a new-style class is a read-only proxy; you
  must set the class's attribute to modify it.  As a consequence, the
5293 5294
  __defined__ attribute of new-style types no longer exists, for lack
  of need: there is once again only one __dict__ (although in the
5295 5296
  future a __cache__ may be resurrected with a similar function, if I
  can prove that it actually speeds things up).
5297

Tim Peters's avatar
Tim Peters committed
5298 5299 5300 5301 5302 5303
- C.__doc__ now works as expected for new-style classes (in 2.2a4 it
  always returned None, even when there was a class docstring).

- doctest now finds and runs docstrings attached to new-style classes,
  class methods, static methods, and properties.

5304
Core and builtins
5305
-----------------
5306

5307 5308 5309 5310 5311 5312 5313 5314 5315
- A very subtle syntactical pitfall in list comprehensions was fixed.
  For example: [a+b for a in 'abc', for b in 'def'].  The comma in
  this example is a mistake.  Previously, this would silently let 'a'
  iterate over the singleton tuple ('abc',), yielding ['abcd', 'abce',
  'abcf'] rather than the intended ['ad', 'ae', 'af', 'bd', 'be',
  'bf', 'cd', 'ce', 'cf'].  Now, this is flagged as a syntax error.
  Note that [a for a in <singleton>] is a convoluted way to say
  [<singleton>] anyway, so it's not like any expressiveness is lost.

5316 5317 5318 5319 5320
- getattr(obj, name, default) now only catches AttributeError, as
  documented, rather than returning the default value for all
  exceptions (which could mask bugs in a __getattr__ hook, for
  example).

5321
- Weak reference objects are now part of the core and offer a C API.
5322
  A bug which could allow a core dump when binary operations involved
Andrew M. Kuchling's avatar
Andrew M. Kuchling committed
5323
  proxy reference has been fixed.  weakref.ReferenceError is now a
5324
  built-in exception.
5325

5326 5327 5328
- unicode(obj) now behaves more like str(obj), accepting arbitrary
  objects, and calling a __unicode__ method if it exists.
  unicode(obj, encoding) and unicode(obj, encoding, errors) still
5329
  require an 8-bit string or character buffer argument.
5330

5331 5332 5333 5334 5335 5336 5337 5338 5339 5340 5341
- isinstance() now allows any object as the first argument and a
  class, a type or something with a __bases__ tuple attribute for the
  second argument.  The second argument may also be a tuple of a
  class, type, or something with __bases__, in which case isinstance()
  will return true if the first argument is an instance of any of the
  things contained in the second argument tuple.  E.g.

  isinstance(x, (A, B))

  returns true if x is an instance of A or B.

5342
Extension modules
5343
-----------------
5344 5345 5346

- thread.start_new_thread() now returns the thread ID (previously None).

5347 5348
- binascii has now two quopri support functions, a2b_qp and b2a_qp.

5349 5350 5351 5352 5353 5354 5355 5356 5357
- readline now supports setting the startup_hook and the
  pre_event_hook, and adds the add_history() function.

- os and posix supports chroot(), setgroups() and unsetenv() where
  available.  The stat(), fstat(), statvfs() and fstatvfs() functions
  now return "pseudo-sequences" -- the various fields can now be
  accessed as attributes (e.g. os.stat("/").st_mtime) but for
  backwards compatibility they also behave as a fixed-length sequence.
  Some platform-specific fields (e.g. st_rdev) are only accessible as
5358 5359 5360 5361 5362
  attributes.

- time: localtime(), gmtime() and strptime() now return a
  pseudo-sequence similar to the os.stat() return value, with
  attributes like tm_year etc.
5363

5364 5365 5366
- Decompression objects in the zlib module now accept an optional
  second parameter to decompress() that specifies the maximum amount
  of memory to use for the uncompressed data.
5367

5368 5369 5370
- optional SSL support in the socket module now exports OpenSSL
  functions RAND_add(), RAND_egd(), and RAND_status().  These calls
  are useful on platforms like Solaris where OpenSSL does not
5371 5372 5373 5374 5375
  automatically seed its PRNG.  Also, the keyfile and certfile
  arguments to socket.ssl() are now optional.

- posixmodule (and by extension, the os module on POSIX platforms) now
  exports O_LARGEFILE, O_DIRECT, O_DIRECTORY, and O_NOFOLLOW.
5376

5377
Library
5378
-------
5379

5380 5381 5382
- doctest now excludes functions and classes not defined by the module
  being tested, thanks to Tim Hochberg.

5383 5384 5385 5386 5387
- HotShot, a new profiler implemented using a C-based callback, has
  been added.  This substantially reduces the overhead of profiling,
  but it is still quite preliminary.  Support modules and
  documentation will be added in upcoming releases (before 2.2 final).

Guido van Rossum's avatar
Guido van Rossum committed
5388 5389 5390 5391 5392
- profile now produces correct output in situations where an exception
  raised in Python is cleared by C code (e.g. hasattr()).  This used
  to cause wrong output, including spurious claims of recursive
  functions and attribution of time spent to the wrong function.

5393 5394 5395 5396 5397 5398 5399 5400 5401
  The code and documentation for the derived OldProfile and HotProfile
  profiling classes was removed.  The code hasn't worked for years (if
  you tried to use them, they raised exceptions).  OldProfile
  intended to reproduce the behavior of the profiler Python used more
  than 7 years ago, and isn't interesting anymore.  HotProfile intended
  to provide a faster profiler (but producing less information), and
  that's a worthy goal we intend to meet via a different approach (but
  without losing information).

5402
- Profile.calibrate() has a new implementation that should deliver
5403 5404 5405 5406 5407 5408
  a much better system-specific calibration constant.  The constant can
  now be specified in an instance constructor, or as a Profile class or
  instance variable, instead of by editing profile.py's source code.
  Calibration must still be done manually (see the docs for the profile
  module).

Raymond Hettinger's avatar
Raymond Hettinger committed
5409
  Note that Profile.calibrate() must be overridden by subclasses.
5410 5411 5412 5413
  Improving the accuracy required exploiting detailed knowledge of
  profiler internals; the earlier method abstracted away the details
  and measured a simplified model instead, but consequently computed
  a constant too small by a factor of 2 on some modern machines.
5414

5415
- quopri's encode and decode methods take an optional header parameter,
5416 5417 5418
  which indicates whether output is intended for the header 'Q'
  encoding.

5419 5420 5421
- The SocketServer.ThreadingMixIn class now closes the request after
  finish_request() returns.  (Not when it errors out though.)

5422
- The nntplib module's NNTP.body() method has grown a 'file' argument
5423 5424 5425 5426 5427 5428 5429 5430 5431 5432 5433 5434
  to allow saving the message body to a file.

- The email package has added a class email.Parser.HeaderParser which
  only parses headers and does not recurse into the message's body.
  Also, the module/class MIMEAudio has been added for representing
  audio data (contributed by Anthony Baxter).

- ftplib should be able to handle files > 2GB.

- ConfigParser.getboolean() now also interprets TRUE, FALSE, YES, NO,
  ON, and OFF.

5435 5436 5437
- xml.dom.minidom NodeList objects now support the length attribute
  and item() method as required by the DOM specifications.

5438
Tools/Demos
5439
-----------
5440 5441 5442 5443

- Demo/dns was removed.  It no longer serves any purpose; a package
  derived from it is now maintained by Anthony Baxter, see
  http://PyDNS.SourceForge.net.
5444

5445 5446 5447
- The freeze tool has been made more robust, and two new options have
  been added: -X and -E.

5448
Build
5449
-----
5450

5451 5452 5453
- configure will use CXX in LINKCC if CXX is used to build main() and
  the system requires to link a C++ main using the C++ compiler.

5454
C API
5455
-----
5456

5457 5458 5459 5460 5461 5462
- The documentation for the tp_compare slot is updated to require that
  the return value must be -1, 0, 1; an arbitrary number <0 or >0 is
  not correct.  This is not yet enforced but will be enforced in
  Python 2.3; even later, we may use -2 to indicate errors and +2 for
  "NotImplemented".  Right now, -1 should be used for an error return.

5463 5464 5465 5466
- PyLong_AsLongLong() now accepts int (as well as long) arguments.
  Consequently, PyArg_ParseTuple's 'L' code also accepts int (as well
  as long) arguments.

5467 5468 5469 5470 5471 5472 5473
- PyThread_start_new_thread() now returns a long int giving the thread
  ID, if one can be calculated; it returns -1 for error, 0 if no
  thread ID is calculated (this is an incompatible change, but only
  the thread module used this API).  This code has only really been
  tested on Linux and Windows; other platforms please beware (and
  report any bugs or strange behavior).

5474 5475 5476
- PyUnicode_FromEncodedObject() no longer accepts Unicode objects as
  input.

5477
New platforms
5478
-------------
5479 5480

Tests
5481
-----
5482 5483

Windows
5484
-------
5485

5486 5487 5488 5489
- Installer:  If you install IDLE, and don't disable file-extension
  registration, a new "Edit with IDLE" context (right-click) menu entry
  is created for .py and .pyw files.

5490 5491 5492
- The signal module now supports SIGBREAK on Windows, thanks to Steven
  Scott.  Note that SIGBREAK is unique to Windows.  The default SIGBREAK
  action remains to call Win32 ExitProcess().  This can be changed via
5493
  signal.signal().  For example::
5494

5495 5496 5497 5498
      # Make Ctrl+Break raise KeyboardInterrupt, like Python's default Ctrl+C
      # (SIGINT) behavior.
      import signal
      signal.signal(signal.SIGBREAK, signal.default_int_handler)
5499

5500
      try:
5501 5502
          while 1:
              pass
5503
      except KeyboardInterrupt:
5504 5505 5506 5507
          # We get here on Ctrl+C or Ctrl+Break now; if we had not changed
          # SIGBREAK, only on Ctrl+C (and Ctrl+Break would terminate the
          # program without the possibility for any Python-level cleanup).
          print "Clean exit"
5508

5509

Tim Peters's avatar
Tim Peters committed
5510 5511 5512
What's New in Python 2.2a4?
===========================

5513 5514
*Release date: 28-Sep-2001*

5515
Type/class unification and new-style classes
5516
--------------------------------------------
5517 5518 5519 5520 5521 5522 5523 5524 5525 5526 5527 5528

- pydoc and inspect are now aware of new-style classes;
  e.g. help(list) at the interactive prompt now shows proper
  documentation for all operations on list objects.

- Applications using Jim Fulton's ExtensionClass module can now safely
  be used with Python 2.2.  In particular, Zope 2.4.1 now works with
  Python 2.2 (as well as with Python 2.1.1).  The Demo/metaclass
  examples also work again.  It is hoped that Gtk and Boost also work
  with 2.2a4 and beyond.  (If you can confirm this, please write
  webmaster@python.org; if there are still problems, please open a bug
  report on SourceForge.)
Tim Peters's avatar
Tim Peters committed
5529

5530
- property() now takes 4 keyword arguments:  fget, fset, fdel and doc.
Andrew M. Kuchling's avatar
Andrew M. Kuchling committed
5531
  These map to read-only attributes 'fget', 'fset', 'fdel', and '__doc__'
5532 5533 5534 5535
  in the constructed property object.  fget, fset and fdel weren't
  discoverable from Python in 2.2a3.  __doc__ is new, and allows to
  associate a docstring with a property.

5536 5537 5538 5539 5540 5541 5542 5543 5544 5545 5546 5547 5548 5549 5550 5551
- Comparison overloading is now more completely implemented.  For
  example, a str subclass instance can properly be compared to a str
  instance, and it can properly overload comparison.  Ditto for most
  other built-in object types.

- The repr() of new-style classes has changed; instead of <type
  'M.Foo'> a new-style class is now rendered as <class 'M.Foo'>,
  *except* for built-in types, which are still rendered as <type
  'Foo'> (to avoid upsetting existing code that might parse or
  otherwise rely on repr() of certain type objects).

- The repr() of new-style objects is now always <Foo object at XXX>;
  previously, it was sometimes <Foo instance at XXX>.

- For new-style classes, what was previously called __getattr__ is now
  called __getattribute__.  This method, if defined, is called for
Andrew M. Kuchling's avatar
Andrew M. Kuchling committed
5552
  *every* attribute access.  A new __getattr__ hook more similar to the
5553 5554 5555 5556 5557 5558 5559 5560 5561 5562 5563 5564 5565 5566 5567 5568 5569 5570 5571 5572 5573 5574 5575 5576 5577 5578 5579 5580
  one in classic classes is defined which is called only if regular
  attribute access raises AttributeError; to catch *all* attribute
  access, you can use __getattribute__ (for new-style classes).  If
  both are defined, __getattribute__ is called first, and if it raises
  AttributeError, __getattr__ is called.

- The __class__ attribute of new-style objects can be assigned to.
  The new class must have the same C-level object layout as the old
  class.

- The builtin file type can be subclassed now.  In the usual pattern,
  "file" is the name of the builtin type, and file() is a new builtin
  constructor, with the same signature as the builtin open() function.
  file() is now the preferred way to open a file.

- Previously, __new__ would only see sequential arguments passed to
  the type in a constructor call; __init__ would see both sequential
  and keyword arguments.  This made no sense whatsoever any more, so
  now both __new__ and __init__ see all arguments.

- Previously, hash() applied to an instance of a subclass of str or
  unicode always returned 0.  This has been repaired.

- Previously, an operation on an instance of a subclass of an
  immutable type (int, long, float, complex, tuple, str, unicode),
  where the subtype didn't override the operation (and so the
  operation was handled by the builtin type), could return that
  instance instead a value of the base type.  For example, if s was of
Andrew M. Kuchling's avatar
Andrew M. Kuchling committed
5581
  a str subclass type, s[:] returned s as-is.  Now it returns a str
5582 5583
  with the same value as s.

5584 5585
- Provisional support for pickling new-style objects has been added.

5586
Core
5587
----
5588

5589 5590
- file.writelines() now accepts any iterable object producing strings.

5591 5592 5593 5594 5595 5596
- PyUnicode_FromEncodedObject() now works very much like
  PyObject_Str(obj) in that it tries to use __str__/tp_str
  on the object if the object is not a string or buffer. This
  makes unicode() behave like str() when applied to non-string/buffer
  objects.

Andrew M. Kuchling's avatar
Andrew M. Kuchling committed
5597 5598
- PyFile_WriteObject now passes Unicode objects to the file's write
  method. As a result, all file-like objects which may be the target
5599 5600 5601
  of a print statement must support Unicode objects, i.e. they must
  at least convert them into ASCII strings.

5602 5603 5604 5605
- Thread scheduling on Solaris should be improved; it is no longer
  necessary to insert a small sleep at the start of a thread in order
  to let other runnable threads be scheduled.

Tim Peters's avatar
Tim Peters committed
5606
Library
5607
-------
Tim Peters's avatar
Tim Peters committed
5608

5609 5610 5611 5612 5613
- StringIO.StringIO instances and cStringIO.StringIO instances support
  read character buffer compatible objects for their .write() methods.
  These objects are converted to strings and then handled as such
  by the instances.

5614 5615 5616 5617
- The "email" package has been added.  This is basically a port of the
  mimelib package <http://sf.net/projects/mimelib> with API changes
  and some implementations updated to use iterators and generators.

5618 5619 5620 5621
- difflib.ndiff() and difflib.Differ.compare() are generators now.  This
  restores the ability of Tools/scripts/ndiff.py to start producing output
  before the entire comparison is complete.

5622 5623 5624 5625
- StringIO.StringIO instances and cStringIO.StringIO instances support
  iteration just like file objects (i.e. their .readline() method is
  called for each iteration until it returns an empty string).

5626 5627 5628 5629
- The codecs module has grown four new helper APIs to access
  builtin codecs: getencoder(), getdecoder(), getreader(),
  getwriter().

5630 5631 5632
- SimpleXMLRPCServer: a new module (based upon SimpleHTMLServer)
  simplifies writing XML RPC servers.

Barry Warsaw's avatar
Barry Warsaw committed
5633
- os.path.realpath(): a new function that returns the absolute pathname
5634 5635 5636
  after interpretation of symbolic links.  On non-Unix systems, this
  is an alias for os.path.abspath().

5637 5638 5639
- operator.indexOf() (PySequence_Index() in the C API) now works with any
  iterable object.

5640 5641 5642 5643 5644
- smtplib now supports various authentication and security features of
  the SMTP protocol through the new login() and starttls() methods.

- hmac: a new module implementing keyed hashing for message
  authentication.
5645

5646 5647
- mimetypes now recognizes more extensions and file types.  At the
  same time, some mappings not sanctioned by IANA were removed.
5648

5649
- The "compiler" package has been brought up to date to the state of
5650 5651 5652 5653
  Python 2.2 bytecode generation.  It has also been promoted from a
  Tool to a standard library package.  (Tools/compiler still exists as
  a sample driver.)

Tim Peters's avatar
Tim Peters committed
5654
Build
5655
-----
Tim Peters's avatar
Tim Peters committed
5656

5657 5658 5659 5660 5661 5662 5663 5664 5665 5666 5667 5668 5669 5670 5671 5672 5673 5674
- Large file support (LFS) is now automatic when the platform supports
  it; no more manual configuration tweaks are needed.  On Linux, at
  least, it's possible to have a system whose C library supports large
  files but whose kernel doesn't; in this case, large file support is
  still enabled but doesn't do you any good unless you upgrade your
  kernel or share your Python executable with another system whose
  kernel has large file support.

- The configure script now supplies plausible defaults in a
  cross-compilation environment.  This doesn't mean that the supplied
  values are always correct, or that cross-compilation now works
  flawlessly -- but it's a first step (and it shuts up most of
  autoconf's warnings about AC_TRY_RUN).

- The Unix build is now a bit less chatty, courtesy of the parser
  generator.  The build is completely silent (except for errors) when
  using "make -s", thanks to a -q option to setup.py.

Tim Peters's avatar
Tim Peters committed
5675
C API
5676
-----
Tim Peters's avatar
Tim Peters committed
5677

5678 5679 5680
- The "structmember" API now supports some new flag bits to deny read
  and/or write access to attributes in restricted execution mode.

Tim Peters's avatar
Tim Peters committed
5681
New platforms
5682
-------------
Tim Peters's avatar
Tim Peters committed
5683

5684 5685 5686
- Compaq's iPAQ handheld, running the "familiar" Linux distribution
  (http://familiar.handhelds.org).

Tim Peters's avatar
Tim Peters committed
5687
Tests
5688
-----
Tim Peters's avatar
Tim Peters committed
5689

5690 5691 5692 5693 5694 5695 5696 5697 5698 5699 5700 5701 5702 5703
- The "classic" standard tests, which work by comparing stdout to
  an expected-output file under Lib/test/output/, no longer stop at
  the first mismatch.  Instead the test is run to completion, and a
  variant of ndiff-style comparison is used to report all differences.
  This is much easier to understand than the previous style of reporting.

- The unittest-based standard tests now use regrtest's test_main()
  convention, instead of running as a side-effect of merely being
  imported.  This allows these tests to be run in more natural and
  flexible ways as unittests, outside the regrtest framework.

- regrtest.py is much better integrated with unittest and doctest now,
  especially in regard to reporting errors.

Tim Peters's avatar
Tim Peters committed
5704
Windows
5705
-------
Tim Peters's avatar
Tim Peters committed
5706

5707
- Large file support now also works for files > 4GB, on filesystems
5708 5709
  that support it (NTFS under Windows 2000).  See "What's New in
  Python 2.2a3" for more detail.
5710

Tim Peters's avatar
Tim Peters committed
5711

5712 5713 5714
What's New in Python 2.2a3?
===========================

5715 5716
*Release Date: 07-Sep-2001*

5717
Core
5718
----
5719

5720 5721 5722
- Conversion of long to float now raises OverflowError if the long is too
  big to represent as a C double.

5723 5724 5725 5726 5727 5728
- The 3-argument builtin pow() no longer allows a third non-None argument
  if either of the first two arguments is a float, or if both are of
  integer types and the second argument is negative (in which latter case
  the arguments are converted to float, so this is really the same
  restriction).

5729 5730 5731 5732 5733 5734 5735 5736 5737 5738 5739 5740 5741 5742 5743 5744 5745
- The builtin dir() now returns more information, and sometimes much
  more, generally naming all attributes of an object, and all attributes
  reachable from the object via its class, and from its class's base
  classes, and so on from them too.  Example:  in 2.2a2, dir([]) returned
  an empty list.  In 2.2a3,

  >>> dir([])
  ['__add__', '__class__', '__contains__', '__delattr__', '__delitem__',
   '__eq__', '__ge__', '__getattr__', '__getitem__', '__getslice__',
   '__gt__', '__hash__', '__iadd__', '__imul__', '__init__', '__le__',
   '__len__', '__lt__', '__mul__', '__ne__', '__new__', '__repr__',
   '__rmul__', '__setattr__', '__setitem__', '__setslice__', '__str__',
   'append', 'count', 'extend', 'index', 'insert', 'pop', 'remove',
   'reverse', 'sort']

  dir(module) continues to return only the module's attributes, though.

5746
- Overflowing operations on plain ints now return a long int rather
5747 5748 5749 5750 5751
  than raising OverflowError.  This is a partial implementation of PEP
  237.  You can use -Wdefault::OverflowWarning to enable a warning for
  this situation, and -Werror::OverflowWarning to revert to the old
  OverflowError exception.

5752
- A new command line option, -Q<arg>, is added to control run-time
5753
  warnings for the use of classic division.  (See PEP 238.)  Possible
5754 5755 5756 5757 5758
  values are -Qold, -Qwarn, -Qwarnall, and -Qnew.  The default is
  -Qold, meaning the / operator has its classic meaning and no
  warnings are issued.  Using -Qwarn issues a run-time warning about
  all uses of classic division for int and long arguments; -Qwarnall
  also warns about classic division for float and complex arguments
Tim Peters's avatar
Tim Peters committed
5759
  (for use with fixdiv.py).
5760 5761 5762 5763 5764 5765 5766
  [Note:  the remainder of this item (preserved below) became
  obsolete in 2.2c1 -- -Qnew has global effect in 2.2] ::

    Using -Qnew is questionable; it turns on new division by default, but
    only in the __main__ module.  You can usefully combine -Qwarn or
    -Qwarnall and -Qnew: this gives the __main__ module new division, and
    warns about classic division everywhere else.
5767

5768
- Many built-in types can now be subclassed.  This applies to int,
5769 5770 5771 5772 5773 5774 5775 5776 5777
  long, float, str, unicode, and tuple.  (The types complex, list and
  dictionary can also be subclassed; this was introduced earlier.)
  Note that restrictions apply when subclassing immutable built-in
  types: you can only affect the value of the instance by overloading
  __new__.  You can add mutable attributes, and the subclass instances
  will have a __dict__ attribute, but you cannot change the "value"
  (as implemented by the base class) of an immutable subclass instance
  once it is created.

5778 5779 5780 5781
- The dictionary constructor now takes an optional argument, a
  mapping-like object, and initializes the dictionary from its
  (key, value) pairs.

5782
- A new built-in type, super, has been added.  This facilitates making
5783 5784 5785
  "cooperative super calls" in a multiple inheritance setting.  For an
  explanation, see http://www.python.org/2.2/descrintro.html#cooperation

5786 5787 5788 5789 5790
- A new built-in type, property, has been added.  This enables the
  creation of "properties".  These are attributes implemented by
  getter and setter functions (or only one of these for read-only or
  write-only attributes), without the need to override __getattr__.
  See http://www.python.org/2.2/descrintro.html#property
5791

5792
- The syntax of floating-point and imaginary literals has been
5793 5794 5795 5796 5797
  liberalized, to allow leading zeroes.  Examples of literals now
  legal that were SyntaxErrors before:

      00.0    0e3   0100j   07.5   00000000000000000008.

5798
- An old tokenizer bug allowed floating point literals with an incomplete
5799 5800
  exponent, such as 1e and 3.1e-.  Such literals now raise SyntaxError.

5801
Library
5802
-------
5803

5804
- telnetlib includes symbolic names for the options, and support for
5805 5806
  setting an option negotiation callback. It also supports processing
  of suboptions.
5807

5808 5809 5810 5811 5812 5813 5814
- The new C standard no longer requires that math libraries set errno to
  ERANGE on overflow.  For platform libraries that exploit this new
  freedom, Python's overflow-checking was wholly broken.  A new overflow-
  checking scheme attempts to repair that, but may not be reliable on all
  platforms (C doesn't seem to provide anything both useful and portable
  in this area anymore).

5815 5816 5817
- Asynchronous timeout actions are available through the new class
  threading.Timer.

5818 5819 5820
- math.log and math.log10 now return sensible results for even huge
  long arguments.  For example, math.log10(10 ** 10000) ~= 10000.0.

5821
- A new function, imp.lock_held(), returns 1 when the import lock is
5822 5823
  currently held.  See the docs for the imp module.

5824
- pickle, cPickle and marshal on 32-bit platforms can now correctly read
5825 5826 5827 5828
  dumps containing ints written on platforms where Python ints are 8 bytes.
  When read on a box where Python ints are 4 bytes, such values are
  converted to Python longs.

5829
- In restricted execution mode (using the rexec module), unmarshalling
5830 5831
  code objects is no longer allowed.  This plugs a security hole.

5832 5833 5834 5835
- unittest.TestResult instances no longer store references to tracebacks
  generated by test failures. This prevents unexpected dangling references
  to objects that should be garbage collected between tests.

5836
Tools
5837
-----
5838

5839 5840 5841
- Tools/scripts/fixdiv.py has been added which can be used to fix
  division operators as per PEP 238.

5842
Build
5843
-----
5844

5845 5846 5847 5848 5849
- If you are an adventurous person using Mac OS X you may want to look at
  Mac/OSX. There is a Makefile there that will build Python as a real Mac
  application, which can be used for experimenting with Carbon or Cocoa.
  Discussion of this on pythonmac-sig, please.

5850
C API
5851
-----
5852 5853

- New function PyObject_Dir(obj), like Python __builtin__.dir(obj).
5854

5855 5856
- Note that PyLong_AsDouble can fail!  This has always been true, but no
  callers checked for it.  It's more likely to fail now, because overflow
5857
  errors are properly detected now.  The proper way to check::
5858

5859 5860
      double x = PyLong_AsDouble(some_long_object);
      if (x == -1.0 && PyErr_Occurred()) {
5861
              /* The conversion failed. */
5862
      }
5863

5864
- The GC API has been changed.  Extensions that use the old API will still
5865 5866 5867 5868
  compile but will not participate in GC.  To upgrade an extension
  module:

    - rename Py_TPFLAGS_GC to PyTPFLAGS_HAVE_GC
5869

5870 5871
    - use PyObject_GC_New or PyObject_GC_NewVar to allocate objects and
      PyObject_GC_Del to deallocate them
5872

5873 5874
    - rename PyObject_GC_Init to PyObject_GC_Track and PyObject_GC_Fini
      to PyObject_GC_UnTrack
5875

5876 5877 5878 5879
    - remove PyGC_HEAD_SIZE from object size calculations

    - remove calls to PyObject_AS_GC and PyObject_FROM_GC

5880
- Two new functions: PyString_FromFormat() and PyString_FromFormatV().
5881 5882 5883
  These can be used safely to construct string objects from a
  sprintf-style format string (similar to the format string supported
  by PyErr_Format()).
5884

5885
New platforms
5886
-------------
5887

5888 5889 5890 5891 5892
- Stephen Hansen contributed patches sufficient to get a clean compile
  under Borland C (Windows), but he reports problems running it and ran
  out of time to complete the port.  Volunteers?  Expect a MemoryError
  when importing the types module; this is probably shallow, and
  causing later failures too.
5893

5894
Tests
5895
-----
5896 5897

Windows
5898
-------
5899

5900 5901 5902 5903
- Large file support is now enabled on Win32 platforms as well as on
  Win64.  This means that, for example, you can use f.tell() and f.seek()
  to manipulate files larger than 2 gigabytes (provided you have enough
  disk space, and are using a Windows filesystem that supports large
5904 5905 5906 5907 5908
  partitions).  Windows filesystem limits:  FAT has a 2GB (gigabyte)
  filesize limit, and large file support makes no difference there.
  FAT32's limit is 4GB, and files >= 2GB are easier to use from Python now.
  NTFS has no practical limit on file size, and files of any size can be
  used from Python now.
5909

5910
- The w9xpopen hack is now used on Windows NT and 2000 too when COMPSPEC
5911 5912
  points to command.com (patch from Brian Quinlan).

5913

Tim Peters's avatar
Tim Peters committed
5914 5915 5916
What's New in Python 2.2a2?
===========================

5917 5918
*Release Date: 22-Aug-2001*

5919
Build
5920
-----
5921

5922 5923 5924
- Tim Peters developed a brand new Windows installer using Wise 8.1,
  generously donated to us by Wise Solutions.

5925 5926 5927
- configure supports a new option --enable-unicode, with the values
  ucs2 and ucs4 (new in 2.2a1). With --disable-unicode, the Unicode
  type and supporting code is completely removed from the interpreter.
Barry Warsaw's avatar
Barry Warsaw committed
5928

5929 5930 5931 5932
- A new configure option --enable-framework builds a Mac OS X framework,
  which "make frameworkinstall" will install. This provides a starting
  point for more mac-like functionality, join pythonmac-sig@python.org
  if you are interested in helping.
5933

Barry Warsaw's avatar
Barry Warsaw committed
5934 5935
- The NeXT platform is no longer supported.

5936
- The 'new' module is now statically linked.
Barry Warsaw's avatar
Barry Warsaw committed
5937

5938
Tools
5939
-----
5940 5941

- The new Tools/scripts/cleanfuture.py can be used to automatically
Andrew M. Kuchling's avatar
Andrew M. Kuchling committed
5942
  edit out obsolete future statements from Python source code.  See
5943 5944
  the module docstring for details.

Tim Peters's avatar
Tim Peters committed
5945
Tests
5946
-----
Tim Peters's avatar
Tim Peters committed
5947

5948
- regrtest.py now knows which tests are expected to be skipped on some
Barry Warsaw's avatar
Barry Warsaw committed
5949 5950 5951
  platforms, allowing to give clearer test result output.  regrtest
  also has optional --use/-u switch to run normally disabled tests
  which require network access or consume significant disk resources.
5952

Tim Peters's avatar
Tim Peters committed
5953 5954 5955 5956
- Several new tests in the standard test suite, with special thanks to
  Nick Mathewson.

Core
5957
----
Tim Peters's avatar
Tim Peters committed
5958

Barry Warsaw's avatar
Barry Warsaw committed
5959 5960 5961 5962 5963 5964 5965 5966 5967
- The floor division operator // has been added as outlined in PEP
  238.  The / operator still provides classic division (and will until
  Python 3.0) unless "from __future__ import division" is included, in
  which case the / operator will provide true division.  The operator
  module provides truediv() and floordiv() functions.  Augmented
  assignment variants are included, as are the equivalent overloadable
  methods and C API methods.  See the PEP for a full discussion:
  <http://python.sf.net/peps/pep-0238.html>

5968 5969 5970 5971 5972
- Future statements are now effective in simulated interactive shells
  (like IDLE).  This should "just work" by magic, but read Michael
  Hudson's "Future statements in simulated shells" PEP 264 for full
  details:  <http://python.sf.net/peps/pep-0264.html>.

5973 5974 5975 5976 5977 5978
- The type/class unification (PEP 252-253) was integrated into the
  trunk and is not so tentative any more (the exact specification of
  some features is still tentative).  A lot of work has done on fixing
  bugs and adding robustness and features (performance still has to
  come a long way).

Marc-André Lemburg's avatar
Marc-André Lemburg committed
5979 5980 5981 5982
- Warnings about a mismatch in the Python API during extension import
  now use the Python warning framework (which makes it possible to
  write filters for these warnings).

5983 5984 5985 5986 5987 5988
- A function's __dict__ (aka func_dict) will now always be a
  dictionary.  It used to be possible to delete it or set it to None,
  but now both actions raise TypeErrors.  It is still legal to set it
  to a dictionary object.  Getting func.__dict__ before any attributes
  have been assigned now returns an empty dictionary instead of None.

5989 5990 5991 5992 5993 5994 5995
- A new command line option, -E, was added which disables the use of
  all environment variables, or at least those that are specifically
  significant to Python.  Usually those have a name starting with
  "PYTHON".  This was used to fix a problem where the tests fail if
  the user happens to have PYTHONHOME or PYTHONPATH pointing to an
  older distribution.

5996
Library
5997
-------
5998

5999 6000
- New class Differ and new functions ndiff() and restore() in difflib.py.
  These package the algorithms used by the popular Tools/scripts/ndiff.py,
6001
  for programmatic reuse.
6002

6003 6004 6005 6006 6007 6008
- New function xml.sax.saxutils.quoteattr():  Quote an XML attribute
  value using the minimal quoting required for the value; more
  reliable than using xml.sax.saxutils.escape() for attribute values.

- Readline completion support for cmd.Cmd was added.

Barry Warsaw's avatar
Barry Warsaw committed
6009 6010 6011 6012 6013 6014
- Calling os.tempnam() or os.tmpnam() generate RuntimeWarnings.

- Added function threading.BoundedSemaphore()

- Added Ka-Ping Yee's cgitb.py module.

6015
- The 'new' module now exposes the CO_xxx flags.
Barry Warsaw's avatar
Barry Warsaw committed
6016

6017 6018
- The gc module offers the get_referents function.

6019
New platforms
6020
-------------
6021 6022

C API
6023
-----
6024

Marc-André Lemburg's avatar
Marc-André Lemburg committed
6025 6026 6027 6028 6029 6030 6031
- Two new APIs PyOS_snprintf() and PyOS_vsnprintf() were added
  which provide a cross-platform implementations for the
  relatively new snprintf()/vsnprintf() C lib APIs. In contrast to
  the standard sprintf() and vsprintf() C lib APIs, these versions
  apply bounds checking on the used buffer which enhances protection
  against buffer overruns.

6032
- Unicode APIs now use name mangling to assure that mixing interpreters
6033 6034
  and extensions using different Unicode widths is rendered next to
  impossible. Trying to import an incompatible Unicode-aware extension
6035 6036 6037 6038
  will result in an ImportError.  Unicode extensions writers must make
  sure to check the Unicode width compatibility in their extensions by
  using at least one of the mangled Unicode APIs in the extension.

6039 6040 6041 6042 6043 6044
- Two new flags METH_NOARGS and METH_O are available in method definition
  tables to simplify implementation of methods with no arguments and a
  single untyped argument. Calling such methods is more efficient than
  calling corresponding METH_VARARGS methods. METH_OLDARGS is now
  deprecated.

6045
Windows
6046
-------
6047 6048 6049 6050

- "import module" now compiles module.pyw if it exists and nothing else
  relevant is found.

Tim Peters's avatar
Tim Peters committed
6051

6052
What's New in Python 2.2a1?
6053 6054
===========================

6055 6056
*Release date: 18-Jul-2001*

6057
Core
6058
----
6059

6060 6061 6062 6063 6064 6065 6066 6067
- TENTATIVELY, a large amount of code implementing much of what's
  described in PEP 252 (Making Types Look More Like Classes) and PEP
  253 (Subtyping Built-in Types) was added.  This will be released
  with Python 2.2a1.  Documentation will be provided separately
  through http://www.python.org/2.2/.  The purpose of releasing this
  with Python 2.2a1 is to test backwards compatibility.  It is
  possible, though not likely, that a decision is made not to release
  this code as part of 2.2 final, if any serious backwards
6068
  incompatibilities are found during alpha testing that cannot be
6069 6070
  repaired.

6071
- Generators were added; this is a new way to create an iterator (see
Tim Peters's avatar
Tim Peters committed
6072
  below) using what looks like a simple function containing one or
6073 6074 6075 6076 6077 6078 6079 6080 6081
  more 'yield' statements.  See PEP 255.  Since this adds a new
  keyword to the language, this feature must be enabled by including a
  future statement: "from __future__ import generators" (see PEP 236).
  Generators will become a standard feature in a future release
  (probably 2.3).  Without this future statement, 'yield' remains an
  ordinary identifier, but a warning is issued each time it is used.
  (These warnings currently don't conform to the warnings framework of
  PEP 230; we intend to fix this in 2.2a2.)

6082 6083 6084 6085 6086
- The UTF-16 codec was modified to be more RFC compliant. It will now
  only remove BOM characters at the start of the string and then
  only if running in native mode (UTF-16-LE and -BE won't remove a
  leading BMO character).

6087 6088 6089 6090 6091 6092 6093
- Strings now have a new method .decode() to complement the already
  existing .encode() method. These two methods provide direct access
  to the corresponding decoders and encoders of the registered codecs.

  To enhance the usability of the .encode() method, the special
  casing of Unicode object return values was dropped (Unicode objects
  were auto-magically converted to string using the default encoding).
6094

6095 6096 6097 6098 6099 6100 6101 6102 6103
  Both methods will now return whatever the codec in charge of the
  requested encoding returns as object, e.g. Unicode codecs will
  return Unicode objects when decoding is requested ("äöü".decode("latin-1")
  will return u"äöü"). This enables codec writer to create codecs
  for various simple to use conversions.

  New codecs were added to demonstrate these new features (the .encode()
  and .decode() columns indicate the type of the returned objects):

6104 6105 6106 6107 6108 6109 6110 6111 6112 6113 6114 6115 6116 6117 6118
  +---------+-----------+-----------+-----------------------------+
  |Name     | .encode() | .decode() | Description                 |
  +=========+===========+===========+=============================+
  |uu       | string    | string    | UU codec (e.g. for email)   |
  +---------+-----------+-----------+-----------------------------+
  |base64   | string    | string    | base64 codec                |
  +---------+-----------+-----------+-----------------------------+
  |quopri   | string    | string    | quoted-printable codec      |
  +---------+-----------+-----------+-----------------------------+
  |zlib     | string    | string    | zlib compression            |
  +---------+-----------+-----------+-----------------------------+
  |hex      | string    | string    | 2-byte hex codec            |
  +---------+-----------+-----------+-----------------------------+
  |rot-13   | string    | Unicode   | ROT-13 Unicode charmap codec|
  +---------+-----------+-----------+-----------------------------+
6119

6120 6121 6122
- Some operating systems now support the concept of a default Unicode
  encoding for file system operations.  Notably, Windows supports 'mbcs'
  as the default.  The Macintosh will also adopt this concept in the medium
6123
  term, although the default encoding for that platform will be other than
6124
  'mbcs'.
6125 6126

  On operating system that support non-ASCII filenames, it is common for
6127 6128 6129 6130 6131
  functions that return filenames (such as os.listdir()) to return Python
  string objects pre-encoded using the default file system encoding for
  the platform.  As this encoding is likely to be different from Python's
  default encoding, converting this name to a Unicode object before passing
  it back to the Operating System would result in a Unicode error, as Python
6132 6133
  would attempt to use its default encoding (generally ASCII) rather than
  the default encoding for the file system.
6134

6135 6136 6137
  In general, this change simply removes surprises when working with
  Unicode and the file system, making these operations work as you expect,
  increasing the transparency of Unicode objects in this context.
6138
  See [????] for more details, including examples.
6139

6140 6141 6142 6143 6144 6145 6146 6147 6148 6149 6150 6151 6152 6153 6154 6155 6156 6157 6158 6159
- Float (and complex) literals in source code were evaluated to full
  precision only when running from a .py file; the same code loaded from a
  .pyc (or .pyo) file could suffer numeric differences starting at about the
  12th significant decimal digit.  For example, on a machine with IEEE-754
  floating arithmetic,

      x = 9007199254740992.0
      print long(x)

  printed 9007199254740992 if run directly from .py, but 9007199254740000
  if from a compiled (.pyc or .pyo) file.  This was due to marshal using
  str(float) instead of repr(float) when building code objects.  marshal
  now uses repr(float) instead, which should reproduce floats to full
  machine precision (assuming the platform C float<->string I/O conversion
  functions are of good quality).

  This may cause floating-point results to change in some cases, and
  usually for the better, but may also cause numerically unstable
  algorithms to break.

6160 6161 6162 6163 6164 6165 6166 6167 6168 6169
- The implementation of dicts suffers fewer collisions, which has speed
  benefits.  However, the order in which dict entries appear in dict.keys(),
  dict.values() and dict.items() may differ from previous releases for a
  given dict.  Nothing is defined about this order, so no program should
  rely on it.  Nevertheless, it's easy to write test cases that rely on the
  order by accident, typically because of printing the str() or repr() of a
  dict to an "expected results" file.  See Lib/test/test_support.py's new
  sortdict(dict) function for a simple way to display a dict in sorted
  order.

6170 6171 6172
- Many other small changes to dicts were made, resulting in faster
  operation along the most common code paths.

6173 6174 6175
- Dictionary objects now support the "in" operator: "x in dict" means
  the same as dict.has_key(x).

6176 6177 6178 6179 6180
- The update() method of dictionaries now accepts generic mapping
  objects.  Specifically the argument object must support the .keys()
  and __getitem__() methods.  This allows you to say, for example,
  {}.update(UserDict())

6181 6182 6183 6184 6185 6186 6187 6188 6189
- Iterators were added; this is a generalized way of providing values
  to a for loop.  See PEP 234.  There's a new built-in function iter()
  to return an iterator.  There's a new protocol to get the next value
  from an iterator using the next() method (in Python) or the
  tp_iternext slot (in C).  There's a new protocol to get iterators
  using the __iter__() method (in Python) or the tp_iter slot (in C).
  Iterating (i.e. a for loop) over a dictionary generates its keys.
  Iterating over a file generates its lines.

6190
- The following functions were generalized to work nicely with iterator
6191
  arguments::
6192

6193
    map(), filter(), reduce(), zip()
6194 6195
    list(), tuple() (PySequence_Tuple() and PySequence_Fast() in C API)
    max(), min()
6196 6197
    join() method of strings
    extend() method of lists
6198 6199
    'x in y' and 'x not in y' (PySequence_Contains() in C API)
    operator.countOf() (PySequence_Count() in C API)
6200
    right-hand side of assignment statements with multiple targets, such as ::
6201
        x, y, z = some_iterable_object_returning_exactly_3_values
6202

6203 6204 6205
- Accessing module attributes is significantly faster (for example,
  random.random or os.path or yourPythonModule.yourAttribute).

6206 6207 6208
- Comparing dictionary objects via == and != is faster, and now works even
  if the keys and values don't support comparisons other than ==.

6209 6210 6211 6212 6213
- Comparing dictionaries in ways other than == and != is slower:  there were
  insecurities in the dict comparison implementation that could cause Python
  to crash if the element comparison routines for the dict keys and/or
  values mutated the dicts.  Making the code bulletproof slowed it down.

6214 6215
- Collisions in dicts are resolved via a new approach, which can help
  dramatically in bad cases.  For example, looking up every key in a dict
6216
  d with d.keys() == [i << 16 for i in range(20000)] is approximately 500x
6217 6218
  faster now.  Thanks to Christian Tismer for pointing out the cause and
  the nature of an effective cure (last December! better late than never).
6219

6220 6221 6222
- repr() is much faster for large containers (dict, list, tuple).


6223
Library
6224
-------
6225

6226
- The constants ascii_letters, ascii_lowercase. and ascii_uppercase
6227
  were added to the string module.  These a locale-independent
6228 6229 6230
  constants, unlike letters, lowercase, and uppercase.  These are now
  use in appropriate locations in the standard library.

6231 6232 6233
- The flags used in dlopen calls can now be configured using
  sys.setdlopenflags and queried using sys.getdlopenflags.

Guido van Rossum's avatar
Guido van Rossum committed
6234 6235 6236 6237 6238
- Fredrik Lundh's xmlrpclib is now a standard library module.  This
  provides full client-side XML-RPC support.  In addition,
  Demo/xmlrpc/ contains two server frameworks (one SocketServer-based,
  one asyncore-based).  Thanks to Eric Raymond for the documentation.

Guido van Rossum's avatar
Guido van Rossum committed
6239 6240 6241 6242
- The xrange() object is simplified: it no longer supports slicing,
  repetition, comparisons, efficient 'in' checking, the tolist()
  method, or the start, stop and step attributes.  See PEP 260.

Martin v. Löwis's avatar
Martin v. Löwis committed
6243 6244
- A new function fnmatch.filter to filter lists of file names was added.

6245 6246
- calendar.py uses month and day names based on the current locale.

6247 6248 6249 6250
- strop is now *really* obsolete (this was announced before with 1.6),
  and issues DeprecationWarning when used (except for the four items
  that are still imported into string.py).

6251 6252 6253 6254 6255
- Cookie.py now sorts key+value pairs by key in output strings.

- pprint.isrecursive(object) didn't correctly identify recursive objects.
  Now it does.

6256 6257
- pprint functions now much faster for large containers (tuple, list, dict).

6258 6259 6260 6261 6262
- New 'q' and 'Q' format codes in the struct module, corresponding to C
  types "long long" and "unsigned long long" (on Windows, __int64).  In
  native mode, these can be used only when the platform C compiler supports
  these types (when HAVE_LONG_LONG is #define'd by the Python config
  process), and then they inherit the sizes and alignments of the C types.
6263 6264
  In standard mode, 'q' and 'Q' are supported on all platforms, and are
  8-byte integral types.
6265

6266 6267 6268 6269 6270
- The site module installs a new built-in function 'help' that invokes
  pydoc.help.  It must be invoked as 'help()'; when invoked as 'help',
  it displays a message reminding the user to use 'help()' or
  'help(object)'.

6271
Tests
6272
-----
6273 6274

- New test_mutants.py runs dict comparisons where the key and value
6275
  comparison operators mutate the dicts randomly during comparison.  This
6276 6277 6278
  rapidly causes Python to crash under earlier releases (not for the faint
  of heart:  it can also cause Win9x to freeze or reboot!).

6279
- New test_pprint.py verifies that pprint.isrecursive() and
6280 6281
  pprint.isreadable() return sensible results.  Also verifies that simple
  cases produce correct output.
6282

6283
C API
6284
-----
6285 6286 6287

- Removed the unused last_is_sticky argument from the internal
  _PyTuple_Resize().  If this affects you, you were cheating.
6288 6289 6290 6291

----

**(For information about older versions, consult the HISTORY file.)**