Skip to content
Projects
Groups
Snippets
Help
Loading...
Help
Support
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
C
cpython
Project overview
Project overview
Details
Activity
Releases
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Issues
0
Issues
0
List
Boards
Labels
Milestones
Merge Requests
0
Merge Requests
0
Analytics
Analytics
Repository
Value Stream
Wiki
Wiki
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Create a new issue
Commits
Issue Boards
Open sidebar
Kirill Smelkov
cpython
Commits
25211f57
Commit
25211f57
authored
Jul 05, 2001
by
Fred Drake
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
Added more information on the differences between the htmllib and HTMLParser
modules.
parent
5fe2c139
Changes
3
Hide whitespace changes
Inline
Side-by-side
Showing
3 changed files
with
16 additions
and
3 deletions
+16
-3
Doc/lib/libhtmllib.tex
Doc/lib/libhtmllib.tex
+6
-0
Doc/lib/libhtmlparser.tex
Doc/lib/libhtmlparser.tex
+7
-1
Doc/lib/libsgmllib.tex
Doc/lib/libsgmllib.tex
+3
-2
No files found.
Doc/lib/libhtmllib.tex
View file @
25211f57
...
...
@@ -70,6 +70,12 @@ handlers for all HTML 2.0 and many HTML 3.0 and 3.2 elements.
\begin{seealso}
\seemodule
{
HTMLParser
}{
Alternate HTML parser that offers a slightly
lower-level view of the input, but is
designed to work with XHTML, and does not
implement some of the SGML syntax not used in
``HTML as deployed'' and which isn't legal
for XHTML.
}
\seemodule
{
htmlentitydefs
}{
Definition of replacement text for HTML
2.0 entities.
}
\seemodule
{
sgmllib
}{
Base class for
\class
{
HTMLParser
}
.
}
...
...
Doc/lib/libhtmlparser.tex
View file @
25211f57
...
...
@@ -6,7 +6,9 @@
This module defines a class
\class
{
HTMLParser
}
which serves as the
basis for parsing text files formatted in HTML
\index
{
HTML
}
(HyperText
Mark-up Language) and XHTML.
\index
{
XHTML
}
Mark-up Language) and XHTML.
\index
{
XHTML
}
Unlike the parser in
\refmodule
{
htmllib
}
, this parser is not based on the SGML parser in
\refmodule
{
sgmllib
}
.
\begin{classdesc}
{
HTMLParser
}{}
...
...
@@ -15,6 +17,10 @@ The \class{HTMLParser} class is instantiated without arguments.
An HTMLParser instance is fed HTML data and calls handler functions
when tags begin and end. The
\class
{
HTMLParser
}
class is meant to be
overridden by the user to provide a desired behavior.
Unlike the parser in
\refmodule
{
htmllib
}
, this parser does not check
that end tags match start tags or call the end-tag handler for
elements which are closed implicitly by closing an outer element.
\end{classdesc}
...
...
Doc/lib/libsgmllib.tex
View file @
25211f57
...
...
@@ -10,8 +10,9 @@ This module defines a class \class{SGMLParser} which serves as the
basis for parsing text files formatted in SGML (Standard Generalized
Mark-up Language). In fact, it does not provide a full SGML parser
--- it only parses SGML insofar as it is used by HTML, and the module
only exists as a base for the
\refmodule
{
htmllib
}
\refstmodindex
{
htmllib
}
module.
only exists as a base for the
\refmodule
{
htmllib
}
module. Another
HTML parser which supports XHTML and offers a somewhat different
interface is available in the
\refmodule
{
HTMLParser
}
module.
\begin{classdesc}
{
SGMLParser
}{}
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment