cpython/Tools/webchecker
Guido van Rossum dc8b7980e0 Skip Montanaro:
The robotparser.py module currently lives in Tools/webchecker.  In
preparation for its migration to Lib, I made the following changes:

    * renamed the test() function _test
    * corrected the URLs in _test() so they refer to actual documents
    * added an "if __name__ == '__main__'" catcher to invoke _test()
      when run as a main program
    * added doc strings for the two main methods, parse and can_fetch
    * replaced usage of regsub and regex with corresponding re code
2000-03-27 19:29:31 +00:00
..
README Complete the integration of Sam Bayer's fixes. 1999-11-17 15:41:47 +00:00
robotparser.py Skip Montanaro: 2000-03-27 19:29:31 +00:00
tktools.py Give in to tabnanny 1998-04-06 14:29:28 +00:00
wcgui.py Changed fron importing wcnew back to webchecker. 1999-11-17 15:40:48 +00:00
wcmac.py Tiny script to play with it on a Mac. 1997-05-28 16:09:02 +00:00
webchecker.py Integrated Sam Bayer's wcnew.py code. It seems silly to keep two 1999-11-17 15:40:08 +00:00
websucker.py Changed fron importing wcnew back to webchecker. 1999-11-17 15:40:48 +00:00
wsgui.py # *NOT* by Sam Bayer: reindented to use 4 spaces like the rest here, 1999-11-17 15:13:21 +00:00

Webchecker
----------

This is a simple web tree checker, useful to find bad links in a web
tree.  It currently checks links pointing within the same subweb for
validity.  The main program is "webchecker.py".  See its doc string
(or invoke it with the option "-?") for more defails.

History:

- Jan 1997.  First release.  The module robotparser.py was written by
Skip Montanaro; the rest is original work by Guido van Rossum.

- May 1999.  Sam Bayer contributed a new version, wcnew.py, which
supports checking internal links (#spam fragments in URLs) and some
other options.

- Nov 1999.  Sam Bayer contributed patches to reintegrate wcnew.py
into webchecker.py, and corresponding mods to wcgui.py and
websucker.py.