An error occurred fetching the project authors.
- 30 Mar, 2003 1 commit
-
-
Martin v. Löwis authored
-
- 14 Mar, 2003 1 commit
-
-
Fred Drake authored
This closes SF patch #669683.
-
- 02 Jun, 2002 1 commit
-
-
Raymond Hettinger authored
-
- 01 Jun, 2002 1 commit
-
-
Raymond Hettinger authored
-
- 26 Oct, 2001 1 commit
-
-
Fred Drake authored
happy. (This does not cover everything it complained about, though.)
-
- 24 Sep, 2001 1 commit
-
-
Fred Drake authored
Use a new internal method, error(), consistently to raise parse errors; the new base class also uses this. Adjust the parse_comment() method to return the new offset into the buffer instead of the number of characters scanned; this was the only helper method that did it this way, so we have better consistency now. Required to share the new base class. This fixes SF bug #448482 and #453706.
-
- 02 Aug, 2001 1 commit
-
-
Martin v. Löwis authored
-
- 19 Jul, 2001 2 commits
-
-
Fred Drake authored
-
Fred Drake authored
This closes SF patch #440153.
-
- 16 Jul, 2001 1 commit
-
-
Fred Drake authored
entity references are not allowed in that mode. Do a better job of scanning <!DOCTYPE ...> declarations; based on the code in HTMLParser.py.
-
- 14 Jul, 2001 1 commit
-
-
Fred Drake authored
this module slightly more resiliant in the face of XHTML input, or just colons in attribute names.
-
- 05 Jul, 2001 1 commit
-
-
Fred Drake authored
values. The change for attribute values matches the way Mozilla and Navigator view the world, at least. This closes SF bug #436621.
-
- 21 May, 2001 1 commit
-
-
Guido van Rossum authored
basically accept <!...> where the dots can be single- or double-quoted strings or any other character except >. Background: I found a real-life example that failed to parse with the old assumption: http://www.opensource.org/licenses/jabberpl.html contains a few constructs of the form <![if !supportLists]>...<![endif]>.
-
- 15 Apr, 2001 1 commit
-
-
Guido van Rossum authored
found by Neil Norwitz's PyChecker.
-
- 16 Mar, 2001 1 commit
-
-
Fred Drake authored
for backward compatibility. Add support for SGML declaration syntax (<!....>) to some reasonable degree. This does not support everything allowed in SGML, but should work with "real" HTML (internal subset in a DOCTYPE is not handled). The content of the declaration is passed to the .handle_decl() method, which can be overridden by subclasses.
-
- 14 Mar, 2001 1 commit
-
-
Fred Drake authored
-
- 19 Feb, 2001 1 commit
-
-
Guido van Rossum authored
sgmllib does not recognize HTML attributes containing the semicolon ';' character. This may be in accordance with the HTML spec, but there are sites that use it (excite.com) and the browsers I regularly use (IE5, Netscape, Opera) all handle it. Doug Fort Downright Software LLC
-
- 15 Feb, 2001 1 commit
-
-
Skip Montanaro authored
also modified check_all function to suppress all warnings since they aren't relevant to what this test is doing (allows quiet checking of regsub, for instance)
-
- 09 Feb, 2001 2 commits
-
-
Eric S. Raymond authored
int().
-
Eric S. Raymond authored
-
- 15 Jan, 2001 1 commit
-
-
Tim Peters authored
-
- 12 Dec, 2000 1 commit
-
-
Fred Drake authored
Use != instead of <> since <> is documented as "obsolescent". Use "is" and "is not" when comparing with None or type objects.
-
- 29 Jun, 2000 1 commit
-
-
Fred Drake authored
get_starttag_text(): New method. Return the text of the most recently parsed start tag, from the '<' to the '>' or '/'. Not really useful for structure processing, but requested for Web-related use. May also be useful for being able to re-generate the input from the parse events, but there's no equivalent for end tags. attrfind: Be a little more forgiving of unquoted attribute values.
-
- 28 Jun, 2000 1 commit
-
-
Jeremy Hylton authored
-
- 04 Feb, 2000 1 commit
-
-
Guido van Rossum authored
The attached patches update the standard library so that all modules have docstrings beginning with one-line summaries. A new docstring was added to formatter. The docstring for os.py was updated to mention nt, os2, ce in addition to posix, dos, mac.
-
- 25 Jan, 1999 1 commit
-
-
Fred Drake authored
of them. I.e., '<a name="foo"href="bar.html">' will now have two attributes recognized. Based on comments from newgroup.
-
- 24 Aug, 1998 1 commit
-
-
Guido van Rossum authored
with tags that have - or . in their names.
-
- 07 Jul, 1998 1 commit
-
-
Guido van Rossum authored
parse_endtag() was restructured in parse_endtag() and finish_endtag().
-
- 28 May, 1998 1 commit
-
-
Guido van Rossum authored
- Handle <? processing instructions >. - Allow . and - in entity names. Also fixed an oversight in the previous fix (in one place, [ \t\r\n] was used instead of string.whitespace).
-
- 16 Apr, 1998 1 commit
-
-
Fred Drake authored
<larsga@ifi.uio.no>.
-
- 26 Mar, 1998 1 commit
-
-
Guido van Rossum authored
-
- 23 Oct, 1997 1 commit
-
-
Guido van Rossum authored
from regex to re style regular expressions. This should make sgmllib and htmllib threadsafe, so I can now create a threaded version of webchecker...
-
- 16 Dec, 1996 1 commit
-
-
Fred Drake authored
<leonard@dstc.edu.au>; allows hyphen and period in the middle of attribute names. Still not allowed as first character; as first character these are illegal in the Reference Concrete Syntax, and we've not identified any use of these characters as the first char in an attribute name in deployment on the web.
-
- 28 Mar, 1996 1 commit
-
-
Guido van Rossum authored
Allow '=' and '~' in unquoted attribute values. Added overridable methods handle_starttag(tag, method, attrs) and handle_endtag(tag, method) so subclasses can decide whether they really want to call the method (e.g. when suppressing some portion of the document). Added support for a number of SGML shortcuts: shorthand full notation <tag>...<>... <tag>...<tag>... <tag>...</> <tag>...</tag> <tag/.../ <tag>...</tag> <tag1<tag2> <tag1><tag2> </tag1</tag2> </tag1></tag2> </tag1<tag2> </tag1><tag2> This required factoring out some common actions and rationalizing the interface to parse_endtag(), so as to make the code more readable. Fixed syntax for &entity and &#char references so the trailing semicolon is optional; removed explicit support for trailing period (which was a TBL mistake in HTML 0.0). Generalized the test program. Tried to speed things up a little. (More to come after the profile results are in.) Fix error recovery: call the end methods popped from the stack instead of the one that triggers. (Plus some complications because of the way HTML extensions are handled in Grail.)
-
- 06 Oct, 1995 1 commit
-
-
Guido van Rossum authored
-
- 30 Sep, 1995 1 commit
-
-
Guido van Rossum authored
-
- 22 Sep, 1995 1 commit
-
-
Guido van Rossum authored
-
- 01 Sep, 1995 1 commit
-
-
Guido van Rossum authored
-
- 10 Aug, 1995 1 commit
-
-
Guido van Rossum authored
-
- 04 Aug, 1995 1 commit
-
-
Guido van Rossum authored
-