mirror of
https://github.com/python/cpython.git
synced 2025-07-23 19:25:40 +00:00
[3.13] gh-130283: update deprecated links and examples in urllib.request
docs (GH-130284) (#131626)
gh-130283: update deprecated links and examples in `urllib.request` docs (GH-130284)
(cherry picked from commit fd459b1153
)
Co-authored-by: Kanishk Pachauri <itskanishkp.py@gmail.com>
Co-authored-by: Bénédikt Tran <10796600+picnixz@users.noreply.github.com>
This commit is contained in:
parent
4af3d0e810
commit
f577a43b8d
1 changed files with 15 additions and 14 deletions
|
@ -1197,17 +1197,13 @@ In addition to the examples below, more examples are given in
|
|||
:ref:`urllib-howto`.
|
||||
|
||||
This example gets the python.org main page and displays the first 300 bytes of
|
||||
it. ::
|
||||
it::
|
||||
|
||||
>>> import urllib.request
|
||||
>>> with urllib.request.urlopen('http://www.python.org/') as f:
|
||||
... print(f.read(300))
|
||||
...
|
||||
b'<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN"
|
||||
"http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">\n\n\n<html
|
||||
xmlns="http://www.w3.org/1999/xhtml" xml:lang="en" lang="en">\n\n<head>\n
|
||||
<meta http-equiv="content-type" content="text/html; charset=utf-8" />\n
|
||||
<title>Python Programming '
|
||||
b'<!doctype html>\n<!--[if lt IE 7]> <html class="no-js ie6 lt-ie7 lt-ie8 lt-ie9"> <![endif]-->\n<!--[if IE 7]> <html class="no-js ie7 lt-ie8 lt-ie9"> <![endif]-->\n<!--[if IE 8]> <html class="no-js ie8 lt-ie9">
|
||||
|
||||
Note that urlopen returns a bytes object. This is because there is no way
|
||||
for urlopen to automatically determine the encoding of the byte stream
|
||||
|
@ -1215,21 +1211,24 @@ it receives from the HTTP server. In general, a program will decode
|
|||
the returned bytes object to string once it determines or guesses
|
||||
the appropriate encoding.
|
||||
|
||||
The following W3C document, https://www.w3.org/International/O-charset\ , lists
|
||||
the various ways in which an (X)HTML or an XML document could have specified its
|
||||
The following HTML spec document, https://html.spec.whatwg.org/#charset, lists
|
||||
the various ways in which an HTML or an XML document could have specified its
|
||||
encoding information.
|
||||
|
||||
For additional information, see the W3C document: https://www.w3.org/International/questions/qa-html-encoding-declarations.
|
||||
|
||||
As the python.org website uses *utf-8* encoding as specified in its meta tag, we
|
||||
will use the same for decoding the bytes object. ::
|
||||
will use the same for decoding the bytes object::
|
||||
|
||||
>>> with urllib.request.urlopen('http://www.python.org/') as f:
|
||||
... print(f.read(100).decode('utf-8'))
|
||||
...
|
||||
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN"
|
||||
"http://www.w3.org/TR/xhtml1/DTD/xhtm
|
||||
<!doctype html>
|
||||
<!--[if lt IE 7]> <html class="no-js ie6 lt-ie7 lt-ie8 lt-ie9"> <![endif]-->
|
||||
<!-
|
||||
|
||||
It is also possible to achieve the same result without using the
|
||||
:term:`context manager` approach. ::
|
||||
:term:`context manager` approach::
|
||||
|
||||
>>> import urllib.request
|
||||
>>> f = urllib.request.urlopen('http://www.python.org/')
|
||||
|
@ -1237,8 +1236,10 @@ It is also possible to achieve the same result without using the
|
|||
... print(f.read(100).decode('utf-8'))
|
||||
... finally:
|
||||
... f.close()
|
||||
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN"
|
||||
"http://www.w3.org/TR/xhtml1/DTD/xhtm
|
||||
...
|
||||
<!doctype html>
|
||||
<!--[if lt IE 7]> <html class="no-js ie6 lt-ie7 lt-ie8 lt-ie9"> <![endif]-->
|
||||
<!--
|
||||
|
||||
In the following example, we are sending a data-stream to the stdin of a CGI
|
||||
and reading the data it returns to us. Note that this example will only work
|
||||
|
|
Loading…
Add table
Add a link
Reference in a new issue