mirror of
https://github.com/python/cpython.git
synced 2025-08-03 00:23:06 +00:00
Added more information on the differences between the htmllib and HTMLParser
modules.
This commit is contained in:
parent
5fe2c139d5
commit
25211f5724
3 changed files with 16 additions and 3 deletions
|
@ -6,7 +6,9 @@
|
|||
|
||||
This module defines a class \class{HTMLParser} which serves as the
|
||||
basis for parsing text files formatted in HTML\index{HTML} (HyperText
|
||||
Mark-up Language) and XHTML.\index{XHTML}
|
||||
Mark-up Language) and XHTML.\index{XHTML} Unlike the parser in
|
||||
\refmodule{htmllib}, this parser is not based on the SGML parser in
|
||||
\refmodule{sgmllib}.
|
||||
|
||||
|
||||
\begin{classdesc}{HTMLParser}{}
|
||||
|
@ -15,6 +17,10 @@ The \class{HTMLParser} class is instantiated without arguments.
|
|||
An HTMLParser instance is fed HTML data and calls handler functions
|
||||
when tags begin and end. The \class{HTMLParser} class is meant to be
|
||||
overridden by the user to provide a desired behavior.
|
||||
|
||||
Unlike the parser in \refmodule{htmllib}, this parser does not check
|
||||
that end tags match start tags or call the end-tag handler for
|
||||
elements which are closed implicitly by closing an outer element.
|
||||
\end{classdesc}
|
||||
|
||||
|
||||
|
|
Loading…
Add table
Add a link
Reference in a new issue