Wed, 27 Sep 2023 17:29:51 +0100 |
Henry S. Thompson |
minor bug wrt EOF of final cdx input file
|
Wed, 27 Sep 2023 17:29:09 +0100 |
Henry S. Thompson |
replicate two extremely-corner cases of the way
|
Tue, 26 Sep 2023 18:55:43 +0100 |
Henry S. Thompson |
a bit more logging
|
Tue, 26 Sep 2023 18:55:11 +0100 |
Henry S. Thompson |
a bit more logging
|
Tue, 26 Sep 2023 17:42:57 +0100 |
Henry S. Thompson |
robotstxt and crawldiagnostics get free ride,
|
Tue, 26 Sep 2023 14:18:40 +0100 |
Henry S. Thompson |
a few more from ecclerig,
|
Tue, 26 Sep 2023 09:03:47 +0100 |
Henry S. Thompson |
refactor datestream reading,
|
Mon, 25 Sep 2023 23:53:13 +0100 |
Henry S. Thompson |
more faithful regexps and non-byte uri output
|
Fri, 22 Sep 2023 15:27:28 +0100 |
Henry S. Thompson |
one uncommited fix from quentin
|
Tue, 19 Sep 2023 19:40:58 +0100 |
Henry Thompson |
pass in debug flag(s) to merge_date.py
|
Tue, 19 Sep 2023 19:29:41 +0100 |
Henry Thompson |
loosen must-match criterion in the both-messy case
|
Tue, 19 Sep 2023 19:28:34 +0100 |
Henry Thompson |
one more sid fix,
|
Sun, 17 Sep 2023 15:18:11 +0100 |
Henry S. Thompson |
working on sessionID pblms, still
|
Thu, 14 Sep 2023 19:27:23 +0100 |
Henry Thompson |
first try
|
Wed, 13 Sep 2023 16:48:43 +0100 |
Henry S. Thompson |
switch to gzip -7 to get comparable compressed cdx block size
|