log

age author description
Thu, 23 Apr 2020 17:26:55 +0100 Henry S. Thompson x
Thu, 23 Apr 2020 17:25:25 +0100 Henry S. Thompson switch for use on login server, invoke by hand with 0/1 as only cmd line arg
Wed, 22 Apr 2020 18:42:40 +0100 Henry S. Thompson java stuff
Wed, 22 Apr 2020 18:42:23 +0100 Henry S. Thompson try nutch fetch for big pdfs
Wed, 15 Apr 2020 18:44:18 +0100 Henry S. Thompson final most general versin
Tue, 14 Apr 2020 17:52:34 +0100 Henry S. Thompson too big for /dev/shm, split in half
Tue, 14 Apr 2020 16:10:22 +0100 Henry S. Thompson one-off to convert big extracts.tar into lots of smaller ones
Mon, 13 Apr 2020 17:29:31 +0100 Henry S. Thompson as used successfully for 3rd run
Mon, 13 Apr 2020 15:24:32 +0100 Henry S. Thompson ready to try another pass with robust diff checking
Mon, 13 Apr 2020 14:12:12 +0100 Henry S. Thompson working towards more robust diff checking
Sat, 11 Apr 2020 13:41:46 +0100 Henry S. Thompson a few tweaks after 2nd parallel run
Fri, 10 Apr 2020 18:45:30 +0100 Henry S. Thompson another few log fixes
Fri, 10 Apr 2020 18:42:08 +0100 Henry S. Thompson as running, modulo 1 log output wrong
Fri, 10 Apr 2020 18:22:48 +0100 Henry S. Thompson log more, work around more glitches
Fri, 10 Apr 2020 18:22:24 +0100 Henry S. Thompson x
Wed, 08 Apr 2020 14:11:04 +0100 Henry S. Thompson start try to work around failures
Wed, 08 Apr 2020 11:27:33 +0100 Henry S. Thompson parallelised version of reExtract.sh
Tue, 07 Apr 2020 18:00:29 +0100 Henry S. Thompson complete change of array var construction, used it for log file names too, tar update enabled, so maybe complete but w/o any parallel
Sat, 04 Apr 2020 15:31:58 +0100 Henry S. Thompson added computation of required additions to tar file, but not actually added
Fri, 03 Apr 2020 19:04:06 +0100 Henry S. Thompson refactored, not tested
Fri, 03 Apr 2020 17:35:17 +0100 Henry S. Thompson done through re-extraction, fixing tars still to come
Thu, 02 Apr 2020 19:21:21 +0100 Henry S. Thompson sketching more
Thu, 02 Apr 2020 19:14:23 +0100 Henry S. Thompson towards re-running extraction in part
Thu, 02 Apr 2020 19:13:40 +0100 Henry S. Thompson up the time limit
Thu, 02 Apr 2020 19:13:14 +0100 Henry S. Thompson clean up after ourselves
Thu, 26 Mar 2020 15:29:12 +0000 Henry S. Thompson fixed scope pblm in tar step
Thu, 26 Mar 2020 12:24:30 +0000 Henry S. Thompson sync up filenames and log names,
Thu, 26 Mar 2020 12:23:33 +0000 Henry S. Thompson pass through extract args
Tue, 24 Mar 2020 17:53:35 +0000 Henry S. Thompson towards sub-division of resulting tar files
Tue, 24 Mar 2020 17:52:52 +0000 Henry S. Thompson not relevant