Mercurial > hg > cc > cirrus_work
graph
-
refactor to provide for buffer overflow fixTue, 31 Oct 2023 14:03:02 +0000, by Henry S. Thompson
-
bug-fix wrt 1st time,Tue, 31 Oct 2023 14:01:50 +0000, by Henry S. Thompson
-
make extra file info optionalMon, 30 Oct 2023 12:19:53 +0000, by Henry S. Thompson
-
forget parallel, just do (default 2) parallel single threadsWed, 25 Oct 2023 23:01:59 +0100, by Henry S. Thompson
-
add missing makedirWed, 25 Oct 2023 23:00:45 +0100, by Henry S. Thompson
-
now does one named segment onlyTue, 24 Oct 2023 16:59:23 +0100, by Henry S. Thompson
-
resurrect parallel fetchTue, 24 Oct 2023 16:58:44 +0100, by Henry S. Thompson
-
convert to single thread,Tue, 24 Oct 2023 14:34:58 +0100, by Henry S. Thompson
-
avoid global name conflictTue, 24 Oct 2023 14:26:36 +0100, by Henry S. Thompson
-
moved from /beegfs/common-crawl to get under .hgWed, 11 Oct 2023 12:51:06 +0100, by Henry S. Thompson
-
fix typoWed, 11 Oct 2023 12:50:29 +0100, by Henry S. Thompson
-
build cluster.idxFri, 06 Oct 2023 15:06:53 +0100, by Henry S. Thompson
-
no longer using cmp_to_keyFri, 06 Oct 2023 15:05:55 +0100, by Henry S. Thompson
-
handle -m case, support src from cmdline mergefixWed, 04 Oct 2023 20:04:34 +0100, by Henry S. Thompson
-
new branch to save do_idx.sh from abandoned merge fixup mergefixThu, 05 Oct 2023 10:42:15 +0100, by Henry S. Thompson
-
try to get the counts right, particularly when re-mergingWed, 04 Oct 2023 18:53:55 +0100, by Henry S. Thompson
-
for use in debugging, see notes and tests 2, 17, merge testWed, 04 Oct 2023 18:51:56 +0100, by Henry S. Thompson
-
add various www deletion casesTue, 03 Oct 2023 17:45:57 +0100, by Henry S. Thompson
-
iterate WPAT fix with improved patternTue, 03 Oct 2023 17:44:59 +0100, by Henry S. Thompson
-
loosen WARC pattern to avoid failure from "mime" = "{...}" interveningTue, 03 Oct 2023 17:43:52 +0100, by Henry S. Thompson
-
refactor to enable rerun with fixup,Mon, 02 Oct 2023 18:56:50 +0100, by Henry S. Thompson
-
correct mistaken futnsz test,Mon, 02 Oct 2023 18:55:48 +0100, by Henry S. Thompson
-
change path to merge_date.pyMon, 02 Oct 2023 18:54:10 +0100, by Henry S. Thompson
-
remove the mistaken deletion of NONPRINT,Mon, 02 Oct 2023 18:52:43 +0100, by Henry S. Thompson
-
fix a bad fix and a bad test for the televida caseSat, 30 Sep 2023 18:04:15 +0100, by Henry S. Thompson
-
fix and test for all-decimal hostSat, 30 Sep 2023 14:13:19 +0100, by Henry S. Thompson
-
no import in lmh.__init__ any moreSat, 30 Sep 2023 14:12:39 +0100, by Henry S. Thompson
-
importing in __init__ causes problemsSat, 30 Sep 2023 14:11:49 +0100, by Henry S. Thompson
-
commented out duplicate, handle comments betterFri, 29 Sep 2023 15:59:34 +0100, by Henry S. Thompson
-
more corner case testsFri, 29 Sep 2023 15:14:29 +0100, by Henry S. Thompson
-
tweaks to get all tests through #14Fri, 29 Sep 2023 15:13:51 +0100, by Henry S. Thompson
-
get 7f (two cases) and %25 workingThu, 28 Sep 2023 18:31:23 +0100, by Henry S. Thompson
-
add televida case testThu, 28 Sep 2023 18:30:48 +0100, by Henry S. Thompson
-
add test descriptionThu, 28 Sep 2023 16:36:15 +0100, by Henry S. Thompson
-
importable just in caseThu, 28 Sep 2023 16:35:39 +0100, by Henry S. Thompson
-
move most of the hacking into fixGoogleCanon,Thu, 28 Sep 2023 16:34:49 +0100, by Henry S. Thompson
-
forget assert, allow multiple failuresThu, 28 Sep 2023 16:10:05 +0100, by Henry S. Thompson
-
xThu, 28 Sep 2023 16:09:38 +0100, by Henry S. Thompson
-
found right place for \x7f hack, maybeThu, 28 Sep 2023 14:08:36 +0100, by Henry S. Thompson
-
readabilityThu, 28 Sep 2023 14:06:11 +0100, by Henry S. Thompson
-
xThu, 28 Sep 2023 11:00:36 +0100, by Henry S. Thompson
-
refactor to sort a module in an lmh packageThu, 28 Sep 2023 11:00:24 +0100, by Henry S. Thompson
-
start some regression testsThu, 28 Sep 2023 10:54:12 +0100, by Henry S. Thompson
-
creating lmh packageThu, 28 Sep 2023 09:01:18 +0100, by Henry S. Thompson
-
moved from binThu, 28 Sep 2023 08:46:01 +0100, by Henry S. Thompson
-
minor bug wrt EOF of final cdx input fileWed, 27 Sep 2023 17:29:51 +0100, by Henry S. Thompson
-
replicate two extremely-corner cases of the wayWed, 27 Sep 2023 17:29:09 +0100, by Henry S. Thompson
-
a bit more loggingTue, 26 Sep 2023 18:55:43 +0100, by Henry S. Thompson
-
a bit more loggingTue, 26 Sep 2023 18:55:11 +0100, by Henry S. Thompson
-
robotstxt and crawldiagnostics get free ride,Tue, 26 Sep 2023 17:42:57 +0100, by Henry S. Thompson
-
a few more from ecclerig,Tue, 26 Sep 2023 14:18:40 +0100, by Henry S. Thompson
-
refactor datestream reading,Tue, 26 Sep 2023 09:03:47 +0100, by Henry S. Thompson
-
more faithful regexps and non-byte uri outputMon, 25 Sep 2023 23:53:13 +0100, by Henry S. Thompson
-
one uncommited fix from quentinFri, 22 Sep 2023 15:27:28 +0100, by Henry S. Thompson
-
pass in debug flag(s) to merge_date.pyTue, 19 Sep 2023 19:40:58 +0100, by Henry Thompson
-
loosen must-match criterion in the both-messy caseTue, 19 Sep 2023 19:29:41 +0100, by Henry Thompson
-
one more sid fix,Tue, 19 Sep 2023 19:28:34 +0100, by Henry Thompson
-
working on sessionID pblms, stillSun, 17 Sep 2023 15:18:11 +0100, by Henry S. Thompson
-
first tryThu, 14 Sep 2023 19:27:23 +0100, by Henry Thompson
-
switch to gzip -7 to get comparable compressed cdx block sizeWed, 13 Sep 2023 16:48:43 +0100, by Henry S. Thompson