Skip to content
Projects
Groups
Snippets
Help
Loading...
Help
Support
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
C
cpython
Project overview
Project overview
Details
Activity
Releases
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Issues
0
Issues
0
List
Boards
Labels
Milestones
Merge Requests
0
Merge Requests
0
Analytics
Analytics
Repository
Value Stream
Wiki
Wiki
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Create a new issue
Commits
Issue Boards
Open sidebar
Kirill Smelkov
cpython
Commits
47ce9655
Commit
47ce9655
authored
Oct 29, 2010
by
Victor Stinner
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
What's New in Python 3.2: update Unicode section
parent
6f35eda4
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
21 additions
and
10 deletions
+21
-10
Doc/whatsnew/3.2.rst
Doc/whatsnew/3.2.rst
+21
-10
No files found.
Doc/whatsnew/3.2.rst
View file @
47ce9655
...
...
@@ -573,13 +573,29 @@ A number of small performance enhancements have been added:
and the Unladen Swallow team in :issue:`9410` and :issue:`3873`.)
Filenames and
Unicode
=======
==============
Unicode
=======
The :mod:`os` module has two new functions: :func:`~os.fsencode` and
:func:`~os.fsdecode`.
:func:`~os.fsdecode`. Add :data:`os.environb`: bytes version of
:data:`os.environ`, :func:`os.getenvb` function and
:data:`os.supports_bytes_environ` constant.
``'mbcs'`` encoding doesn't ignore the error handler argument anymore. By
default (strict mode), it raises an UnicodeDecodeError on undecodable byte
sequence and UnicodeEncodeError on unencodable character. To get the ``'mbcs'``
encoding of Python 3.1, use ``'ignore'`` error handler to decode and
``'replace'`` error handler to encode. ``'mbcs'`` supports ``'strict'`` and
``'ignore'`` error handlers for decoding, and ``'strict'`` and ``'replace'``
for encoding.
On Mac OS X, Python uses ``'utf-8'`` to decode the command line arguments,
instead of the locale encoding (which is ISO-8859-1 if the ``LANG`` environment
variable is not set).
.. XXX mention Victor's improvements for support of undecodable filenames.
By default, tarfile uses ``'utf-8'`` encoding on Windows (instead of
``'mbcs'``), and the ``'surrogateescape'`` error handler on all operating
systems.
.. IDLE
...
...
@@ -642,9 +658,4 @@ require changes to your code:
instead; the new type has a well-defined interface for passing typing safety
information and a less complicated signature for calling a destructor.
* mbcs encoding doesn't ignore the error handler argument anymore. By default
(strict mode), it raises an UnicodeDecodeError on undecodable byte sequence
and UnicodeEncodeError on unencodable character. To get the mbcs encoding of
Python 3.1, use ``'ignore'`` error handler to decode and ``'replace'`` error
handler to encode. mbcs now supports ``'strict'`` and ``'ignore'`` error
handlers for decoding, and ``'strict'`` and ``'replace'`` for encoding.
* Remove sys.setfilesystemencoding() function: it was broken by design.
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment