Tue, 28 Apr 2020 19:02:14 +0100 |
Henry S. Thompson |
impose some limits
|
Tue, 28 Apr 2020 19:01:41 +0100 |
Henry S. Thompson |
x
|
Fri, 24 Apr 2020 20:12:44 +0100 |
Henry S. Thompson |
x
|
Fri, 24 Apr 2020 20:12:29 +0100 |
Henry S. Thompson |
mostly from Sebastian
|
Fri, 24 Apr 2020 20:03:29 +0100 |
Henry S. Thompson |
misc
|
Fri, 24 Apr 2020 20:01:35 +0100 |
Henry S. Thompson |
misc
|
Fri, 24 Apr 2020 20:01:25 +0100 |
Henry S. Thompson |
fix from Sebastian
|
Fri, 24 Apr 2020 19:57:16 +0100 |
Henry S. Thompson |
misc
|
Fri, 24 Apr 2020 19:55:11 +0100 |
Henry S. Thompson |
misc
|
Fri, 24 Apr 2020 15:20:33 +0100 |
Henry S. Thompson |
several efficiency (hofentlich) tweaks
|
Thu, 23 Apr 2020 17:26:55 +0100 |
Henry S. Thompson |
x
|
Thu, 23 Apr 2020 17:25:25 +0100 |
Henry S. Thompson |
switch for use on login server, invoke by hand with 0/1 as only cmd line arg
|
Wed, 22 Apr 2020 18:42:40 +0100 |
Henry S. Thompson |
java stuff
|
Wed, 22 Apr 2020 18:42:23 +0100 |
Henry S. Thompson |
try nutch fetch for big pdfs
|
Wed, 15 Apr 2020 18:44:18 +0100 |
Henry S. Thompson |
final most general versin
|
Tue, 14 Apr 2020 17:52:34 +0100 |
Henry S. Thompson |
too big for /dev/shm, split in half
|
Tue, 14 Apr 2020 16:10:22 +0100 |
Henry S. Thompson |
one-off to convert big extracts.tar into lots of smaller ones
|
Mon, 13 Apr 2020 17:29:31 +0100 |
Henry S. Thompson |
as used successfully for 3rd run
|
Mon, 13 Apr 2020 15:24:32 +0100 |
Henry S. Thompson |
ready to try another pass with robust diff checking
|
Mon, 13 Apr 2020 14:12:12 +0100 |
Henry S. Thompson |
working towards more robust diff checking
|
Sat, 11 Apr 2020 13:41:46 +0100 |
Henry S. Thompson |
a few tweaks after 2nd parallel run
|
Fri, 10 Apr 2020 18:45:30 +0100 |
Henry S. Thompson |
another few log fixes
|
Fri, 10 Apr 2020 18:42:08 +0100 |
Henry S. Thompson |
as running, modulo 1 log output wrong
|
Fri, 10 Apr 2020 18:22:48 +0100 |
Henry S. Thompson |
log more, work around more glitches
|
Fri, 10 Apr 2020 18:22:24 +0100 |
Henry S. Thompson |
x
|
Wed, 08 Apr 2020 14:11:04 +0100 |
Henry S. Thompson |
start try to work around failures
|
Wed, 08 Apr 2020 11:27:33 +0100 |
Henry S. Thompson |
parallelised version of reExtract.sh
|
Tue, 07 Apr 2020 18:00:29 +0100 |
Henry S. Thompson |
complete change of array var construction, used it for log file names too, tar update enabled, so maybe complete but w/o any parallel
|
Sat, 04 Apr 2020 15:31:58 +0100 |
Henry S. Thompson |
added computation of required additions to tar file, but not actually added
|
Fri, 03 Apr 2020 19:04:06 +0100 |
Henry S. Thompson |
refactored, not tested
|