• Lars Gustäbel's avatar
    Issue #8633: Support for POSIX.1-2008 binary pax headers. · 1465cc28
    Lars Gustäbel authored
    tarfile is now able to read and write pax headers with a
    "hdrcharset=BINARY" record. This record was introduced in
    POSIX.1-2008 as a method to store unencoded binary strings that
    cannot be translated to UTF-8. In practice, this is just a workaround
    that allows a tar implementation to store filenames that do not
    comply with the current filesystem encoding and thus cannot be
    decoded correctly.
    Additionally, tarfile works around a bug in current versions of GNU
    tar: undecodable filenames are stored as-is in a pax header without a
    "hdrcharset" record being added. Technically, these headers are
    invalid, but tarfile manages to read them correctly anyway.
    1465cc28
tarfile.rst 24.9 KB