Mercurial > hg > cc > cirrus_home
graph
-
improved F handling/loggingSat, 09 May 2020 16:16:28 +0100, by Henry S. Thompson
-
keep separate antecedants separate, buggy?Fri, 08 May 2020 19:52:36 +0100, by Henry S. Thompson
-
track redirects, need to us full crawldiagnostics.warc.gz for "location:" and "Uri:"Thu, 07 May 2020 18:47:24 +0100, by Henry S. Thompson
-
refactor, change summary print (problem?)Thu, 07 May 2020 11:33:24 +0100, by Henry S. Thompson
-
bare framework workingWed, 06 May 2020 18:28:52 +0100, by Henry S. Thompson
-
starting on tool to assemble as complete as we have info wrt a seed URIWed, 06 May 2020 14:25:44 +0100, by Henry S. Thompson
-
use local .m2/repository for Hadoop 3.4.0Wed, 06 May 2020 14:24:42 +0100, by Henry S. Thompson
-
works for big files with Hadoop 3.4.0Wed, 06 May 2020 14:23:33 +0100, by Henry S. Thompson
-
xWed, 06 May 2020 14:22:48 +0100, by Henry S. Thompson
-
log trucationsTue, 28 Apr 2020 19:02:34 +0100, by Henry S. Thompson
-
impose some limitsTue, 28 Apr 2020 19:02:14 +0100, by Henry S. Thompson
-
xTue, 28 Apr 2020 19:01:41 +0100, by Henry S. Thompson
-
xFri, 24 Apr 2020 20:12:44 +0100, by Henry S. Thompson
-
mostly from SebastianFri, 24 Apr 2020 20:12:29 +0100, by Henry S. Thompson
-
miscFri, 24 Apr 2020 20:03:29 +0100, by Henry S. Thompson
-
miscFri, 24 Apr 2020 20:01:35 +0100, by Henry S. Thompson
-
fix from SebastianFri, 24 Apr 2020 20:01:25 +0100, by Henry S. Thompson
-
miscFri, 24 Apr 2020 19:57:16 +0100, by Henry S. Thompson
-
miscFri, 24 Apr 2020 19:55:11 +0100, by Henry S. Thompson
-
several efficiency (hofentlich) tweaksFri, 24 Apr 2020 15:20:33 +0100, by Henry S. Thompson
-
xThu, 23 Apr 2020 17:26:55 +0100, by Henry S. Thompson
-
switch for use on login server, invoke by hand with 0/1 as only cmd line argThu, 23 Apr 2020 17:25:25 +0100, by Henry S. Thompson
-
java stuffWed, 22 Apr 2020 18:42:40 +0100, by Henry S. Thompson
-
try nutch fetch for big pdfsWed, 22 Apr 2020 18:42:23 +0100, by Henry S. Thompson
-
final most general versinWed, 15 Apr 2020 18:44:18 +0100, by Henry S. Thompson
-
too big for /dev/shm, split in halfTue, 14 Apr 2020 17:52:34 +0100, by Henry S. Thompson
-
one-off to convert big extracts.tar into lots of smaller onesTue, 14 Apr 2020 16:10:22 +0100, by Henry S. Thompson
-
as used successfully for 3rd runMon, 13 Apr 2020 17:29:31 +0100, by Henry S. Thompson
-
ready to try another pass with robust diff checkingMon, 13 Apr 2020 15:24:32 +0100, by Henry S. Thompson
-
working towards more robust diff checkingMon, 13 Apr 2020 14:12:12 +0100, by Henry S. Thompson