mirror of
https://github.com/python/cpython.git
synced 2025-08-04 17:08:35 +00:00
Do not put a raw REPLACEMENT CHARACTER in the document.
This commit is contained in:
parent
c5b0ec0a83
commit
c8c60c2284
1 changed files with 4 additions and 1 deletions
|
@ -263,10 +263,13 @@ Unicode result). The following examples show the differences::
|
|||
UnicodeDecodeError: 'utf8' codec can't decode byte 0x80 in position 0:
|
||||
unexpected code byte
|
||||
>>> b'\x80abc'.decode("utf-8", "replace")
|
||||
'<EFBFBD>abc'
|
||||
'?abc'
|
||||
>>> b'\x80abc'.decode("utf-8", "ignore")
|
||||
'abc'
|
||||
|
||||
(In this code example, the Unicode replacement character has been replaced by
|
||||
a question mark because it may not be displayed on some systems.)
|
||||
|
||||
Encodings are specified as strings containing the encoding's name. Python 3.2
|
||||
comes with roughly 100 different encodings; see the Python Library Reference at
|
||||
:ref:`standard-encodings` for a list. Some encodings have multiple names; for
|
||||
|
|
Loading…
Add table
Add a link
Reference in a new issue