Update w.r.t. PEP 3137.

2025-09-26 18:29:57 +00:00 · 2007-10-23 18:17:00 +00:00 · 2007-10-23 18:17:00 +00:00 · 7c67613bc7
commit 7c67613bc7
parent 94c2c75b5e
1 changed files with 56 additions and 178 deletions
--- a/Doc/library/stdtypes.rst
+++ b/Doc/library/stdtypes.rst
@ -457,11 +457,11 @@ object) supplying the :meth:`__iter__` and :meth:`__next__` methods.
 .. _typesseq:
-Sequence Types --- :class:`str`, :class:`bytes`, :class:`list`, :class:`tuple`, :class:`buffer`, :class:`range`
+Sequence Types --- :class:`str`, :class:`bytes`, :class:`buffer`, :class:`list`, :class:`tuple`, :class:`range`
 ===============================================================================================================
-There are five sequence types: strings, byte sequences, lists, tuples, buffers,
+There are five sequence types: strings, byte sequences, buffers, lists, tuples,
-and range objects.  (For other containers see the built in :class:`dict`,
+and range objects.  (For other containers see the built-in :class:`dict`,
 :class:`list`, :class:`set`, and :class:`tuple` classes, and the
 :mod:`collections` module.)
@ -469,31 +469,34 @@ and range objects.  (For other containers see the built in :class:`dict`,
   object: sequence
   object: string
   object: bytes
   object: buffer
   object: tuple
   object: list
   object: buffer
   object: range
-String literals are written in single or double quotes: ``'xyzzy'``,
+Strings contain Unicode characters.  Their literals are written in single or
-``"frobozz"``.  See :ref:`strings` for more about string literals.  In addition
+double quotes: ``'xyzzy'``, ``"frobozz"``.  See :ref:`strings` for more about
-to the functionality described here, there are also string-specific methods
+string literals.  In addition to the functionality described here, there are
-described in the :ref:`string-methods` section.  Bytes objects can be
+also string-specific methods described in the :ref:`string-methods` section.
-constructed from literals too; use a ``b`` prefix with normal string syntax:
+
-``b'xyzzy'``.
+Bytes and buffer objects contain single bytes -- the former is immutable while
 the latter is a mutable sequence.  Bytes objects can be constructed from
 literals too; use a ``b`` prefix with normal string syntax: ``b'xyzzy'``.
 To construct buffer objects, use the :func:`buffer` function.
 .. warning::
   While string objects are sequences of characters (represented by strings of
-   length 1), bytes objects are sequences of *integers* (between 0 and 255),
+   length 1), bytes and buffer objects are sequences of *integers* (between 0
-   representing the ASCII value of single bytes.  That means that for a bytes
+   and 255), representing the ASCII value of single bytes.  That means that for
-   object *b*, ``b[0]`` will be an integer, while ``b[0:1]`` will be a bytes
+   a bytes or buffer object *b*, ``b[0]`` will be an integer, while ``b[0:1]``
-   object of length 1.
+   will be a bytes or buffer object of length 1.
   Also, while in previous Python versions, byte strings and Unicode strings
   could be exchanged for each other rather freely (barring encoding issues),
-   strings and bytes are completely separate concepts.  There's no implicit
+   strings and bytes are now completely separate concepts.  There's no implicit
-   en-/decoding if you pass and object of the wrong type or try to e.g. compare
+   en-/decoding if you pass and object of the wrong type.  A string always
-   a string with a bytes object.
+   compares unequal to a bytes or buffer object.
 Lists are constructed with square brackets, separating items with commas: ``[a,
 b, c]``.  Tuples are constructed by the comma operator (not within square
@ -501,10 +504,6 @@ brackets), with or without enclosing parentheses, but an empty tuple must have
 the enclosing parentheses, such as ``a, b, c`` or ``()``.  A single item tuple
 must have a trailing comma, such as ``(d,)``.
 Buffer objects are not directly supported by Python syntax, but can be created
 by calling the builtin function :func:`buffer`.  They don't support
 concatenation or repetition.
 Objects of type range are similar to buffers in that there is no specific syntax
 to create them, but they are created using the :func:`range` function.  They
 don't support slicing, concatenation or repetition, and using ``in``, ``not
@ -586,9 +585,9 @@ Notes:
      [[3], [3], [3]]
   What has happened is that ``[[]]`` is a one-element list containing an empty
-   list, so all three elements of ``[[]] * 3`` are (pointers to) this single empty
+   list, so all three elements of ``[[]] * 3`` are (pointers to) this single
-   list.  Modifying any of the elements of ``lists`` modifies this single list.
+   empty list.  Modifying any of the elements of ``lists`` modifies this single
-   You can create a list of different lists this way::
+   list.  You can create a list of different lists this way::
      >>> lists = [[] for i in range(3)]
      >>> lists[0].append(3)
@ -599,8 +598,8 @@ Notes:
 (3)
   If *i* or *j* is negative, the index is relative to the end of the string:
-   ``len(s) + i`` or ``len(s) + j`` is substituted.  But note that ``-0`` is still
+   ``len(s) + i`` or ``len(s) + j`` is substituted.  But note that ``-0`` is
-   ``0``.
+   still ``0``.
 (4)
   The slice of *s* from *i* to *j* is defined as the sequence of items with index
@ -1160,16 +1159,16 @@ Mutable Sequence Types
 .. index::
   triple: mutable; sequence; types
   object: list
-   object: bytes
+   object: buffer
-List and bytes objects support additional operations that allow in-place
+List and buffer objects support additional operations that allow in-place
 modification of the object.  Other mutable sequence types (when added to the
 language) should also support these operations.  Strings and tuples are
 immutable sequence types: such objects cannot be modified once created. The
 following operations are defined on mutable sequence types (where *x* is an
 arbitrary object).
-Note that while lists allow their items to be of any type, bytes object
+Note that while lists allow their items to be of any type, buffer object
 "items" are all integers in the range 0 <= x < 256.
 +------------------------------+--------------------------------+---------------------+
@ -1263,7 +1262,7 @@ Notes:
   sequence.
 (7)
-   :meth:`sort` is not supported by bytes objects.
+   :meth:`sort` is not supported by buffer objects.
   The :meth:`sort` method takes optional arguments for controlling the
   comparisons.
@ -1297,51 +1296,34 @@ Notes:
 .. _bytes-methods:
-Bytes Methods
+Bytes and Buffer Methods
-------------
+------------------------
 .. index:: pair: bytes; methods
           pair: buffer; methods
-In addition to the operations on mutable sequence types (see
+Bytes and buffer objects, being "strings of bytes", have all methods found on
-:ref:`typesseq-mutable`), bytes objects, being "mutable ASCII strings" have
+strings, with the exception of :func:`encode`, :func:`format` and
-further useful methods also found on strings.
+:func:`isidentifier`, which do not make sense with these types.  Wherever one of
 these methods needs to interpret the bytes as characters (e.g. the :func:`is...`
 methods), the ASCII character set is assumed.
-.. XXX "count" is documented as a mutable sequence method differently above
+.. note::
 .. XXX perhaps just split bytes and list methods
-.. method:: bytes.count(sub[, start[, end]])
+   The methods on bytes and buffer objects don't accept strings as their
   arguments, just as the methods on strings don't accept bytes as their
   arguments.  For example, you have to write ::
-   In contrast to the standard sequence ``count`` method, this returns the
+      a = "abc"
-   number of occurrences of substring (not item) *sub* in the slice
+      b = a.replace("a", "f")
-   ``[start:end]``.  Optional arguments *start* and *end* are interpreted as in
+
-   slice notation.
+   and ::
      a = b"abc"
      b = a.replace(b"a", b"f")
-.. method:: bytes.decode([encoding[, errors]])
+The bytes and buffer types have an additional class method:
   Decode the bytes using the codec registered for *encoding*. *encoding*
   defaults to the default string encoding.  *errors* may be given to set a
   different error handling scheme.  The default is ``'strict'``, meaning that
   encoding errors raise :exc:`UnicodeError`.  Other possible values are
   ``'ignore'``, ``'replace'`` and any other name registered via
   :func:`codecs.register_error`, see section :ref:`codec-base-classes`.
 .. method:: bytes.endswith(suffix[, start[, end]])
   Return ``True`` if the bytes object ends with the specified *suffix*,
   otherwise return ``False``.  *suffix* can also be a tuple of suffixes to look
   for.  With optional *start*, test beginning at that position.  With optional
   *end*, stop comparing at that position.
 .. method:: bytes.find(sub[, start[, end]])
   Return the lowest index in the string where substring *sub* is found, such that
   *sub* is contained in the range [*start*, *end*].  Optional arguments *start*
   and *end* are interpreted as in slice notation.  Return ``-1`` if *sub* is not
   found.
 .. method:: bytes.fromhex(string)
@ -1354,113 +1336,9 @@ further useful methods also found on strings.
      >>> bytes.fromhex('f0 f1f2  ')
      b'\xf0\xf1\xf2'
 .. XXX verify/document translate() semantics!
-.. method:: bytes.index(sub[, start[, end]])
+   .. method:: bytes.translate(table[, delete])
   Like :meth:`find`, but raise :exc:`ValueError` when the substring is not found.
 .. method:: bytes.join(seq)
   Return a bytes object which is the concatenation of the bytes objects in the
   sequence *seq*.  The separator between elements is the bytes object providing
   this method.
 .. method:: bytes.lstrip(which)
   Return a copy of the bytes object with leading bytes removed.  The *which*
   argument is a bytes object specifying the set of bytes to be removed.  As
   with :meth:`str.lstrip`, the *which* argument is not a prefix; rather, all
   combinations of its values are stripped.
 .. method:: bytes.partition(sep)
   Split the bytes object at the first occurrence of *sep*, and return a 3-tuple
   containing the part before the separator, the separator itself, and the part
   after the separator.  If the separator is not found, return a 3-tuple
   containing the bytes object itself, followed by two empty strings.
 .. method:: bytes.replace(old, new[, count])
   Return a copy of the bytes object with all occurrences of substring *old*
   replaced by *new*.  If the optional argument *count* is given, only the first
   *count* occurrences are replaced.
 .. method:: bytes.rfind(sub[, start[, end]])
   Return the highest index in the string where substring *sub* is found, such
   that *sub* is contained within the slice ``[start:end]``.  Optional arguments
   *start* and *end* are interpreted as in slice notation.  Return ``-1`` on
   failure.
 .. method:: bytes.rindex(sub[, start[, end]])
   Like :meth:`rfind` but raises :exc:`ValueError` when the substring *sub* is
   not found.
 .. method:: bytes.rpartition(sep)
   Split the bytes object at the last occurrence of *sep*, and return a 3-tuple
   containing the part before the separator, the separator itself, and the part
   after the separator.  If the separator is not found, return a 3-tuple
   containing two empty strings, followed by the string itself.
 .. method:: bytes.rsplit(sep[, maxsplit])
   Return a list of substrings, using *sep* as the delimiter.  If *maxsplit* is
   given, at most *maxsplit* splits are done, the *rightmost* ones.  Except for
   splitting from the right, :meth:`rsplit` behaves like :meth:`split` which is
   described in detail below.
 .. method:: bytes.rstrip(which)
   Return a copy of the bytes object with trailing bytes removed.  The *which*
   argument is a bytes object specifying the set of bytes to be removed.  As
   with :meth:`str.rstrip`, The *chars* argument is not a suffix; rather, all
   combinations of its values are stripped.
 .. method:: bytes.split(sep[, maxsplit])
   Return a list of substrings, using *sep* as the delimiter.  If *maxsplit* is
   given, at most *maxsplit* splits are done (thus, the list will have at most
   ``maxsplit+1`` elements).  If *maxsplit* is not specified, then there is no
   limit on the number of splits (all possible splits are made).  Consecutive
   delimiters are not grouped together and are deemed to delimit empty strings
   (for example, ``b'1,,2'.split(b',')`` returns ``[b'1', b'', b'2']``).  The
   *sep* argument may consist of multiple bytes (for example, ``b'1, 2,
   3'.split(b', ')`` returns ``[b'1', b'2', b'3']``).  Splitting an empty string
   with a specified separator returns ``[b'']``.
 .. method:: bytes.startswith(prefix[, start[, end]])
   Return ``True`` if the bytes object starts with the *prefix*, otherwise
   return ``False``.  *prefix* can also be a tuple of prefixes to look for.
   With optional *start*, test string beginning at that position.  With optional
   *end*, stop comparing string at that position.
 .. method:: bytes.strip(which)
   Return a copy of the bytes object with leading and trailing bytes found in
   *which* removed.  The *which* argument is a bytes object specifying the set
   of characters to be removed.  The *which* argument is not a prefix or suffix;
   rather, all combinations of its values are stripped::
      >>> b'www.example.com'.strip(b'cmowz.')
      b'example'
 .. method:: bytes.translate(table[, delete])
   Return a copy of the bytes object where all bytes occurring in the optional
   argument *delete* are removed, and the remaining bytes have been mapped