Mercurial > hg > cc > cirrus_work
graph
-
moved from /beegfs/common-crawl to get under .hgWed, 11 Oct 2023 12:51:06 +0100, by Henry S. Thompson
-
fix typoWed, 11 Oct 2023 12:50:29 +0100, by Henry S. Thompson
-
build cluster.idxFri, 06 Oct 2023 15:06:53 +0100, by Henry S. Thompson
-
no longer using cmp_to_keyFri, 06 Oct 2023 15:05:55 +0100, by Henry S. Thompson
-
handle -m case, support src from cmdline mergefixWed, 04 Oct 2023 20:04:34 +0100, by Henry S. Thompson
-
new branch to save do_idx.sh from abandoned merge fixup mergefixThu, 05 Oct 2023 10:42:15 +0100, by Henry S. Thompson
-
try to get the counts right, particularly when re-mergingWed, 04 Oct 2023 18:53:55 +0100, by Henry S. Thompson
-
for use in debugging, see notes and tests 2, 17, merge testWed, 04 Oct 2023 18:51:56 +0100, by Henry S. Thompson
-
add various www deletion casesTue, 03 Oct 2023 17:45:57 +0100, by Henry S. Thompson
-
iterate WPAT fix with improved patternTue, 03 Oct 2023 17:44:59 +0100, by Henry S. Thompson
-
loosen WARC pattern to avoid failure from "mime" = "{...}" interveningTue, 03 Oct 2023 17:43:52 +0100, by Henry S. Thompson
-
refactor to enable rerun with fixup,Mon, 02 Oct 2023 18:56:50 +0100, by Henry S. Thompson
-
correct mistaken futnsz test,Mon, 02 Oct 2023 18:55:48 +0100, by Henry S. Thompson
-
change path to merge_date.pyMon, 02 Oct 2023 18:54:10 +0100, by Henry S. Thompson