Mercurial > hg > cc > cirrus_work
graph
-
refactor to enable rerun with fixup,Mon, 02 Oct 2023 18:56:50 +0100, by Henry S. Thompson
-
correct mistaken futnsz test,Mon, 02 Oct 2023 18:55:48 +0100, by Henry S. Thompson
-
change path to merge_date.pyMon, 02 Oct 2023 18:54:10 +0100, by Henry S. Thompson
-
remove the mistaken deletion of NONPRINT,Mon, 02 Oct 2023 18:52:43 +0100, by Henry S. Thompson
-
fix a bad fix and a bad test for the televida caseSat, 30 Sep 2023 18:04:15 +0100, by Henry S. Thompson
-
fix and test for all-decimal hostSat, 30 Sep 2023 14:13:19 +0100, by Henry S. Thompson
-
no import in lmh.__init__ any moreSat, 30 Sep 2023 14:12:39 +0100, by Henry S. Thompson
-
importing in __init__ causes problemsSat, 30 Sep 2023 14:11:49 +0100, by Henry S. Thompson
-
commented out duplicate, handle comments betterFri, 29 Sep 2023 15:59:34 +0100, by Henry S. Thompson
-
more corner case testsFri, 29 Sep 2023 15:14:29 +0100, by Henry S. Thompson
-
tweaks to get all tests through #14Fri, 29 Sep 2023 15:13:51 +0100, by Henry S. Thompson
-
get 7f (two cases) and %25 workingThu, 28 Sep 2023 18:31:23 +0100, by Henry S. Thompson
-
add televida case testThu, 28 Sep 2023 18:30:48 +0100, by Henry S. Thompson
-
add test descriptionThu, 28 Sep 2023 16:36:15 +0100, by Henry S. Thompson
-
importable just in caseThu, 28 Sep 2023 16:35:39 +0100, by Henry S. Thompson
-
move most of the hacking into fixGoogleCanon,Thu, 28 Sep 2023 16:34:49 +0100, by Henry S. Thompson
-
forget assert, allow multiple failuresThu, 28 Sep 2023 16:10:05 +0100, by Henry S. Thompson
-
xThu, 28 Sep 2023 16:09:38 +0100, by Henry S. Thompson
-
found right place for \x7f hack, maybeThu, 28 Sep 2023 14:08:36 +0100, by Henry S. Thompson
-
readabilityThu, 28 Sep 2023 14:06:11 +0100, by Henry S. Thompson
-
xThu, 28 Sep 2023 11:00:36 +0100, by Henry S. Thompson
-
refactor to sort a module in an lmh packageThu, 28 Sep 2023 11:00:24 +0100, by Henry S. Thompson
-
start some regression testsThu, 28 Sep 2023 10:54:12 +0100, by Henry S. Thompson
-
creating lmh packageThu, 28 Sep 2023 09:01:18 +0100, by Henry S. Thompson
-
moved from binThu, 28 Sep 2023 08:46:01 +0100, by Henry S. Thompson
-
minor bug wrt EOF of final cdx input fileWed, 27 Sep 2023 17:29:51 +0100, by Henry S. Thompson
-
replicate two extremely-corner cases of the wayWed, 27 Sep 2023 17:29:09 +0100, by Henry S. Thompson
-
a bit more loggingTue, 26 Sep 2023 18:55:43 +0100, by Henry S. Thompson
-
a bit more loggingTue, 26 Sep 2023 18:55:11 +0100, by Henry S. Thompson
-
robotstxt and crawldiagnostics get free ride,Tue, 26 Sep 2023 17:42:57 +0100, by Henry S. Thompson
-
a few more from ecclerig,Tue, 26 Sep 2023 14:18:40 +0100, by Henry S. Thompson
-
refactor datestream reading,Tue, 26 Sep 2023 09:03:47 +0100, by Henry S. Thompson
-
more faithful regexps and non-byte uri outputMon, 25 Sep 2023 23:53:13 +0100, by Henry S. Thompson
-
one uncommited fix from quentinFri, 22 Sep 2023 15:27:28 +0100, by Henry S. Thompson
-
pass in debug flag(s) to merge_date.pyTue, 19 Sep 2023 19:40:58 +0100, by Henry Thompson
-
loosen must-match criterion in the both-messy caseTue, 19 Sep 2023 19:29:41 +0100, by Henry Thompson
-
one more sid fix,Tue, 19 Sep 2023 19:28:34 +0100, by Henry Thompson
-
working on sessionID pblms, stillSun, 17 Sep 2023 15:18:11 +0100, by Henry S. Thompson
-
first tryThu, 14 Sep 2023 19:27:23 +0100, by Henry Thompson
-
switch to gzip -7 to get comparable compressed cdx block sizeWed, 13 Sep 2023 16:48:43 +0100, by Henry S. Thompson
-
use my own Canonicalizer to fix more obscureWed, 13 Sep 2023 12:41:55 +0100, by Henry S. Thompson
-
re-instate logging splits for .idxWed, 13 Sep 2023 12:40:39 +0100, by Henry S. Thompson
-
reinstate better check to start queuing,Tue, 12 Sep 2023 12:14:04 +0100, by Henry S. Thompson
-
bug4 fixed, but that created a new, earlier bugMon, 11 Sep 2023 22:06:45 +0100, by Henry S. Thompson
-
rework handling of session key problemMon, 11 Sep 2023 12:56:47 +0100, by Henry S. Thompson
-
initialise paths for csingFri, 08 Sep 2023 21:40:52 +0100, by Henry S. Thompson
-
d'ohFri, 08 Sep 2023 21:40:06 +0100, by Henry S. Thompson
-
include full URI in outputFri, 08 Sep 2023 18:06:54 +0100, by Henry S. Thompson
-
try to do csing correctly on compute nodesFri, 08 Sep 2023 18:05:57 +0100, by Henry S. Thompson
-
version which outputs more identification,Fri, 08 Sep 2023 09:29:25 +0100, by Henry S. Thompson
-
last version before giving up on approach based only on key and datestampThu, 07 Sep 2023 18:03:55 +0100, by Henry S. Thompson
-
improve reordering, still failing on cdx-00004Wed, 06 Sep 2023 18:51:21 +0100, by Henry S. Thompson
-
attempt at reordering if necessaryTue, 05 Sep 2023 17:33:29 +0100, by Henry S. Thompson
-
mostly working, but need to reorder in case of cfid and friendsTue, 05 Sep 2023 17:32:46 +0100, by Henry S. Thompson
-
flip loopsThu, 31 Aug 2023 14:14:21 +0100, by Henry S. Thompson
-
merge a stream of ks files with a set of cdx filesWed, 30 Aug 2023 21:49:43 +0100, by Henry S. Thompson
-
final keystroke fixes, recurse and decimal www strippingWed, 30 Aug 2023 11:11:31 +0100, by Henry S. Thompson
-
final keystroke fixes,Wed, 30 Aug 2023 11:10:54 +0100, by Henry S. Thompson
-
handle double .www, more keep-me charsMon, 28 Aug 2023 21:07:43 +0100, by Henry S. Thompson
-
work-around for weird handling of %-encoding in Java impl. of SURTThu, 24 Aug 2023 18:21:41 +0100, by Henry S. Thompson
-
merge, including pointless fix wrt pqMon, 21 Aug 2023 13:06:20 -0400, by Henry Thompson
-
use surt instead of trying to create index term by handSat, 19 Aug 2023 16:33:23 -0400, by Henry Thompson
-
mergeSat, 19 Aug 2023 16:02:29 -0400, by Henry Thompson
-
staleSat, 19 Aug 2023 15:58:38 -0400, by Henry Thompson
-
catching up by hand with markup version,Sat, 19 Aug 2023 15:53:59 -0400, by Henry Thompson
-
include timestampMon, 21 Aug 2023 13:37:07 +0100, by Henry S. Thompson
-
include querySun, 20 Aug 2023 00:28:43 +0100, by Henry S. Thompson
-
make CC's own sorting explicitFri, 18 Aug 2023 18:25:54 +0100, by Henry S. Thompson
-
handle corner cases with final . and initial www..+Thu, 10 Aug 2023 22:14:49 +0100, by Henry S. Thompson
-
handle %-encoded utf-8 as idnaWed, 09 Aug 2023 02:01:32 +0100, by Henry S. Thompson
-
mergeTue, 08 Aug 2023 17:48:29 +0100, by Henry S. Thompson
-
compute timestamps, key and sort lmh linesTue, 08 Aug 2023 17:47:27 +0100, by Henry S. Thompson
-
work with csingTue, 08 Aug 2023 17:46:20 +0100, by Henry S. Thompson
-
get man -k workingTue, 08 Aug 2023 17:46:02 +0100, by Henry S. Thompson
-
for warc_lmh slurm logsFri, 28 Jul 2023 00:50:13 +0100, by Henry Thompson
-
for timing analysisWed, 26 Jul 2023 18:42:19 +0100, by Henry S. Thompson
-
add support for multiple calls to srun with a counterFri, 21 Jul 2023 11:37:47 +0100, by Henry S. Thompson
-
fix eof bug, expand error messagesThu, 20 Jul 2023 10:32:55 +0100, by Henry S. Thompson
-
part 2 is now working for all typesWed, 19 Jul 2023 13:20:46 +0100, by Henry S. Thompson
-
add a response-only testWed, 19 Jul 2023 13:19:58 +0100, by Henry S. Thompson
-
revert to just showing first LMWed, 19 Jul 2023 13:19:42 +0100, by Henry S. Thompson
-
more testsFri, 14 Jul 2023 17:39:14 +0100, by Henry S. Thompson
-
Test 2 works with parts=1,2,3.Fri, 14 Jul 2023 17:38:54 +0100, by Henry S. Thompson
-
whole workingFri, 14 Jul 2023 12:08:09 +0100, by Henry S. Thompson
-
tests 1 & 2 now workingThu, 13 Jul 2023 14:02:02 +0100, by Henry S. Thompson
-
avoid slicing buf by using memoryview to save copyingThu, 13 Jul 2023 11:28:24 +0100, by Henry S. Thompson
-
but skip at eobp is not working (with test 2)Wed, 12 Jul 2023 19:07:56 +0100, by Henry S. Thompson
-
works with all types, part=1Wed, 12 Jul 2023 18:48:27 +0100, by Henry S. Thompson
-
rework completely to refill as much as possible only when necessary,Mon, 10 Jul 2023 19:52:18 +0100, by Henry S. Thompson
-
finds multiplesMon, 10 Jul 2023 18:17:35 +0100, by Henry S. Thompson
-
little stepsFri, 07 Jul 2023 19:30:23 +0100, by Henry S. Thompson
-
made 1 mean 1, still losing after a whileFri, 07 Jul 2023 19:04:16 +0100, by Henry S. Thompson
-
better debugging outputFri, 07 Jul 2023 17:04:05 +0100, by Henry S. Thompson
-
working better, gets confused by 3-part responseFri, 07 Jul 2023 17:03:52 +0100, by Henry S. Thompson
-
a bit betterFri, 07 Jul 2023 13:39:23 +0100, by Henry S. Thompson
-
just barely working for 1, need to rethink bufferingThu, 06 Jul 2023 14:53:28 +0100, by Henry S. Thompson
-
starting on conversion to direct-querying of bufferThu, 06 Jul 2023 13:27:33 +0100, by Henry S. Thompson
-
sicThu, 06 Jul 2023 10:19:02 +0100, by Henry S. Thompson
-
support on-board unzipping, reduce buffer size to 2MBWed, 05 Jul 2023 19:32:36 +0100, by Henry S. Thompson
-
make test 1 idempotentWed, 05 Jul 2023 19:32:02 +0100, by Henry S. Thompson
-
just count part lengthWed, 05 Jul 2023 17:51:44 +0100, by Henry S. Thompson
-
get EOF right, finallyWed, 05 Jul 2023 17:49:24 +0100, by Henry S. Thompson
-
make warc.py a library, separate out testingWed, 05 Jul 2023 15:37:16 +0100, by Henry S. Thompson
-
correct commentWed, 05 Jul 2023 15:12:54 +0100, by Henry S. Thompson
-
add lots more debugging output,Wed, 05 Jul 2023 15:12:07 +0100, by Henry S. Thompson
-
moved from home binWed, 05 Jul 2023 15:09:57 +0100, by Henry S. Thompson
-
doc pointerTue, 10 Jan 2023 17:49:01 +0000, by Henry S. Thompson
-
push actions in main fnTue, 13 Dec 2022 14:16:42 +0000, by Henry S. Thompson
-
fixed for paperTue, 13 Dec 2022 14:16:22 +0000, by Henry S. Thompson
-
fix NThu, 24 Nov 2022 12:37:17 +0000, by Henry S. Thompson
-
compute and graph confidence intervalsWed, 23 Nov 2022 11:05:45 +0000, by Henry S. Thompson
-
generalise histTue, 22 Nov 2022 19:13:25 +0000, by Henry S. Thompson
-
add sort flag to plot_xTue, 22 Nov 2022 11:02:51 +0000, by Henry S. Thompson
-
get multi-ranking done rightThu, 17 Nov 2022 13:51:19 +0000, by Henry S. Thompson
-
comments and more care about rows vs. columnsThu, 17 Nov 2022 11:27:07 +0000, by Henry S. Thompson
-
start work on ranking,Wed, 16 Nov 2022 19:52:50 +0000, by Henry S. Thompson
-
Spearman for matlabWed, 16 Nov 2022 17:29:55 +0000, by Henry S. Thompson
-
move all plots into functionsWed, 16 Nov 2022 17:28:56 +0000, by Henry S. Thompson
-
a bit moreTue, 15 Nov 2022 19:37:28 +0000, by Henry S. Thompson
-
framework for stats over results of rank correlationsMon, 14 Nov 2022 18:52:35 +0000, by Henry S. Thompson