Commit graph

671 commits

Author SHA1 Message Date
Guido van Rossum
4f6ecdaacf Add root URL entry box, separate start/stop/step buttons.
If the users selects an item in 'To check', start checking there.
1997-02-01 05:17:29 +00:00
Guido van Rossum
6133ec656e Process <img> and <frame> tags. Don't bother skipping second href. 1997-02-01 05:16:08 +00:00
Guido van Rossum
de99d310cc Check in another copy of tktools.py... 1997-01-31 18:58:53 +00:00
Guido van Rossum
06981c328d Tk interface to webchecker. Not fully featured yet, but usable. 1997-01-31 18:58:12 +00:00
Guido van Rossum
0b0b5f0279 Spin off checking of external page in a subroutine.
Increase MAXPAGE to 150K.
Add back printing of __doc__ for usage message.
1997-01-31 18:57:23 +00:00
Guido van Rossum
e5605ba3c2 Many misc changes.
- Faster HTML parser derivede from SGMLparser (Fred Gansevles).

- All manipulations of todo, done, ext, bad are done via methods, so a
derived class can override.  Also moved the 'done' marking to
dopage(), so run() is much simpler.

- Added a method status() which returns a string containing the
summary counts; added a "total" count.

- Drop the guessing of the file type before opening the document -- we
still need to check those links for validity!

- Added a subroutine to close a connection which first slurps up the
remaining data when it's an ftp URL -- apparently closing an ftp
connection without reading till the end makes it hang.

- Added -n option to skip running (only useful with -R).

- The Checker object now has an instance variable which is set to 1
when it is changed.  This is not pickled.
1997-01-31 14:43:15 +00:00
Guido van Rossum
c59a5d449f Set proper User-agent header (Python-webchecker/<version>).
When -x is combined with -q, still do the checking, but don't print
the error in this phase -- they are reported by report_errors().
1997-01-30 06:04:00 +00:00
Guido van Rossum
2739cd74b3 Some refinements of the external-link checking code: insert the errors
in the 'bad' dictionary (sanitize them so they are picklable; the
sanitation code is now a subroutine); don't check mailto: URLs; omit
colon in Error message.
1997-01-30 04:26:57 +00:00
Guido van Rossum
de66268588 Added -x option to check external links. Slooooow! 1997-01-30 03:58:21 +00:00
Guido van Rossum
325a64f207 Catch I/O errors when parsing robots.txt file.
Add version number, printed at startup in non-quited mode.
1997-01-30 03:30:20 +00:00
Guido van Rossum
df47bafa1c Basic README file 1997-01-30 03:24:00 +00:00
Guido van Rossum
3edbb35023 Added robots.txt support, using Skip Montanaro's parser.
Fixed occasional inclusion of unpicklable objects (Message in errors).
Changed indent of a few messages.
1997-01-30 03:19:41 +00:00
Guido van Rossum
bbf8c2fafd Skip Montanaro's robots.txt parser. 1997-01-30 03:18:23 +00:00
Guido van Rossum
272b37d686 web tree checker 1997-01-30 02:44:48 +00:00
Guido van Rossum
d7e4705d8f mime types guesser 1997-01-30 02:44:20 +00:00
Guido van Rossum
04f2b453bb Version 1.1. Fix memory leak and expensive comparison with None. 1997-01-18 02:20:37 +00:00
Guido van Rossum
f06ee5fa07 /usr/local/bin/python -> /usr/bin/env python 1996-11-27 19:52:01 +00:00
Guido van Rossum
499181a833 Changed so that it replaces /usr/local/bin/python with /usr/bin/env python. 1996-11-27 19:43:49 +00:00
Guido van Rossum
d5c58c34db Changed logic so it now replaces anything that has #! and python in
the first line, replacing the entire line.
1996-11-27 19:43:01 +00:00
Guido van Rossum
fc0588241c Script to fix cprt notice 1996-11-27 19:41:55 +00:00
Barry Warsaw
3ffc503682 Removed the USA'isms that aren't really true 1996-11-20 18:43:05 +00:00
Barry Warsaw
7081a69b24 /usr/bin/env is slightly more portable 1996-11-20 15:19:10 +00:00
Barry Warsaw
00bf8efe0b Added a few meta info 1996-11-20 15:17:50 +00:00
Barry Warsaw
2a6e68f693 Output original address 1996-11-19 17:59:07 +00:00
Barry Warsaw
de0e8f763b Updated country codes as per ISO3166 10-Feb-1994 1996-11-18 23:22:34 +00:00
Barry Warsaw
cb2b672015 implementation complete. need to update country codes 1996-11-18 22:57:43 +00:00
Barry Warsaw
3a7212c86a Added `ru' --> Russia 1996-11-18 22:03:36 +00:00
Barry Warsaw
f6d53448e5 A rewrite for better Python-ish style 1996-11-18 21:26:56 +00:00
Guido van Rossum
e41d00bb6a Fixed a bug: files that no longer exist remotely would never be
removed from .mirrorinfo.  Now they are (even if -r is not specified
-- the files are not removed, just their .mirrorinfo entry).

Added a feature: the -s pattern option is also used to skip local
files when removing (i.e. -r won't remove local files matching the -s
patterns).
1996-11-14 18:24:47 +00:00
Fred Drake
abcb38250b (texi2html.py): Minor improvement to HTML 3.0 output for code examples. 1996-10-25 22:13:59 +00:00
Guido van Rossum
d266eb460e New permission notice, includes CNRI. 1996-10-25 14:44:06 +00:00
Fred Drake
22f7f9e671 (texi2html.py): Added CLASS=Navigation attribute to DIV around top/bottom
navigation links for HTML 3 version.

	Forced a blank line above the footnotes separator for HTML 2; at
	least one page did not get this spaced correctly.
1996-10-11 16:54:00 +00:00
Fred Drake
0282726643 (texi2html.py): Tightened the generated HTML slightly. Added -p option to
print section titles even when the debugging output is not enabled.
	Added -3 option to generate HTML 3.0 constructs where meaningful.

	Removed repititive garbage generation:  the old version added simple
	descriptive comments after every datadesc/funcdesc/*desc entry:

	function(args) -- function of module xxxx
	    Description....

	These comments are no longer generated:

	function(args)
	    Description....
1996-10-09 19:05:12 +00:00
Fred Drake
a39a25e5ec (texi2html.py): Use "if __name__ == '__main__':" test. Adjust indentation
to 4 spaces per level (no longer 8).

(Makefile):  Use .pyc versions of partparse.py and texi2html.py to generate
	converted documentation formats.  This reduces the startup costs;
	probably doesn't affect anyone but me in reality, but helps when
	working on the docs.
1996-09-13 14:44:34 +00:00
Guido van Rossum
be5e34204e Note obsolescence 1996-09-11 23:34:32 +00:00
Guido van Rossum
dd6798c691 Added NumVersion definition. 1996-08-26 14:38:40 +00:00
Guido van Rossum
c30461dd38 Insert filename in IOError message. 1996-08-26 14:37:47 +00:00
Guido van Rossum
96c4dd95cf OK, now it uses sys.[exec_]prefix and supports
a -o option to specify an output directory.  *Much* more convenient.
Removed a whole lot of hullabaloo from the README file, too.
1996-08-26 05:14:20 +00:00
Guido van Rossum
6498cad34e Get rid of it totally 1996-08-26 04:10:33 +00:00
Guido van Rossum
f34bf1c07b Added warning that this is unsupported 1996-08-26 03:50:10 +00:00
Guido van Rossum
aa233f18b2 Changes for 1.4 install targets 1996-08-26 03:49:49 +00:00
Guido van Rossum
6100d914e8 Allow whitespace before '#' in preprocessor directives. 1996-08-22 23:12:23 +00:00
Guido van Rossum
29901ff528 Some improvements by Fred Drake. 1996-08-09 21:46:34 +00:00
Guido van Rossum
8e7a54f525 More changes by Sjoerd & Jack 1996-07-21 02:50:30 +00:00
Guido van Rossum
e47d5f9c52 Addition by Jack to test for missing type support 1996-07-21 02:49:55 +00:00
Guido van Rossum
d4cc04c672 Add test that script name ends in .py.
Cosmetic changes to usage message (refer to "make install" now).
1996-06-17 17:49:13 +00:00
Guido van Rossum
f75f80eb48 Get struct _frozen defn from Python.h; new variable names. 1996-06-17 17:48:30 +00:00
Guido van Rossum
814842f395 Removed most of the README text since it is all about the Mac specific
examples, which no longer live here.
1996-05-24 18:43:47 +00:00
Guido van Rossum
d544d0126d Moved customization to bgenlocations.py module. 1996-04-12 16:39:00 +00:00
Guido van Rossum
9a2c546e75 Create files as temp file and move to real location only when complete.
Added some try-except statements around listdir and mkdir operations.
1996-04-09 02:51:23 +00:00