Thu, 04 Jun 2020 20:44:44 +0000 |
Henry S. Thompson |
use basefile instead of transferfile, and remove cleanup: belt and braces wrt lossage of sac_schemes.py in 15% of 1000_k3,
default tip
|
Thu, 04 Jun 2020 17:58:10 +0000 |
Henry S. Thompson |
use sorted insertion into tuple list for properties
|
Thu, 04 Jun 2020 16:10:55 +0000 |
Henry S. Thompson |
don't over-count duplicate URIs in multiple properties, produce composite keys instead
|
Thu, 04 Jun 2020 12:08:29 +0000 |
Henry S. Thompson |
switch to curl->file, enable retries
|
Wed, 03 Jun 2020 22:08:01 +0000 |
Henry S. Thompson |
fix minor argument passing snafus
|
Wed, 03 Jun 2020 16:40:34 +0000 |
Henry S. Thompson |
support multiple approaches to key combination, use local files to collect results
|
Tue, 02 Jun 2020 17:35:07 +0000 |
Henry S. Thompson |
added more robust (I hope) error handling,
|
Sun, 31 May 2020 12:06:44 +0000 |
Henry S. Thompson |
trying to get my own mapper working
|
Thu, 28 May 2020 12:55:03 +0000 |
Henry S. Thompson |
refactor a bit, add support for sac with bespoke mapper
|
Thu, 28 May 2020 09:58:38 +0000 |
Henry S. Thompson |
get quoting and arg positions right
|
Thu, 28 May 2020 09:56:42 +0000 |
Henry S. Thompson |
move to right place in tree
|
Wed, 27 May 2020 20:54:34 +0000 |
Henry S. Thompson |
from lukasz git repo 2020-05-26 (see ~/src/wecu), then editted,
|
Wed, 27 May 2020 20:54:14 +0000 |
Henry S. Thompson |
remove -s 4
|
Fri, 08 Feb 2019 17:55:48 +0000 |
Henry S. Thompson |
use bindWorkerVars.sh
|
Fri, 08 Feb 2019 17:46:49 +0000 |
Henry S. Thompson |
new scripts
|
Sun, 23 Dec 2018 19:28:37 +0000 |
Henry S. Thompson |
final merge
|
Mon, 17 Dec 2018 11:29:05 +0000 |
Henry S. Thompson |
final merge
|
Sun, 16 Dec 2018 14:25:42 +0000 |
Henry S. Thompson |
final merge
|
Sat, 15 Dec 2018 10:36:52 +0000 |
Henry S. Thompson |
final merge
|
Sat, 15 Dec 2018 10:34:14 +0000 |
Henry S. Thompson |
revert cci pattern
|
Mon, 10 Dec 2018 14:51:52 +0000 |
Henry S. Thompson |
using ptimedWhich.sh, _timedWhich.py
|
Mon, 10 Dec 2018 14:43:18 +0000 |
Henry S. Thompson |
cci path hack changed for 2018.04
|
Mon, 03 Dec 2018 21:10:02 +0000 |
Henry S. Thompson |
finally got logging sorted
|
Sat, 01 Dec 2018 16:25:04 +0000 |
Henry S. Thompson |
one last hack
|
Sat, 01 Dec 2018 12:13:34 +0000 |
Henry S. Thompson |
knock off a few more relatively common cases
|
Fri, 30 Nov 2018 18:37:40 +0000 |
Henry S. Thompson |
update to use _timedWhich.py
|
Fri, 30 Nov 2018 15:41:02 +0000 |
Henry S. Thompson |
works on one file
|
Fri, 30 Nov 2018 13:44:50 +0000 |
Henry S. Thompson |
simpler fix for d-o-m default
|
Fri, 30 Nov 2018 13:43:36 +0000 |
Henry S. Thompson |
start work on python version of tW.sh
|
Thu, 29 Nov 2018 15:14:46 +0000 |
Henry S. Thompson |
try to fix a few more niggling bugs
|
Thu, 29 Nov 2018 13:52:07 +0000 |
Henry S. Thompson |
final merge
|
Thu, 29 Nov 2018 13:51:43 +0000 |
Henry S. Thompson |
parameterise name
|
Thu, 29 Nov 2018 13:41:27 +0000 |
Henry S. Thompson |
merge the uploaded results of fixAndMerge
|
Tue, 27 Nov 2018 19:08:08 +0000 |
Henry S. Thompson |
what it says on the tin
|
Wed, 21 Nov 2018 18:42:56 +0000 |
Henry S. Thompson |
fixes to logging and efficiency, see also notes.txt wrt patches to dateparser
|
Tue, 20 Nov 2018 14:49:07 +0000 |
Henry S. Thompson |
fixDates, _fixAndMerge, _doFetch
|
Tue, 20 Nov 2018 10:31:05 +0000 |
Henry S. Thompson |
rewritten to be faster, maybe, and avoid earlier bug
|
Mon, 19 Nov 2018 18:33:17 +0000 |
Henry S. Thompson |
partway to rework after failure of mergedWhich.x64700
|
Mon, 19 Nov 2018 18:32:30 +0000 |
Henry S. Thompson |
hacking to get id into wbash.sh, maybe buggy?
|
Mon, 19 Nov 2018 18:31:21 +0000 |
Henry S. Thompson |
as used for mergedWhich.x64700
|
Sat, 10 Nov 2018 13:22:21 +0000 |
Henry S. Thompson |
more stuff
|
Sat, 10 Nov 2018 13:20:56 +0000 |
Henry S. Thompson |
attempt to fix robustness pblms
|
Wed, 07 Nov 2018 19:36:30 +0000 |
Henry S. Thompson |
-mforce (?) multiple processors to be used
|
Wed, 07 Nov 2018 17:37:27 +0000 |
Henry S. Thompson |
works on which.16
|
Wed, 07 Nov 2018 14:15:56 +0000 |
Henry S. Thompson |
improved error handling,
|
Wed, 31 Oct 2018 21:42:34 +0000 |
Henry S. Thompson |
more tweaks wrt getting H4 going
|
Mon, 22 Oct 2018 10:04:12 +0000 |
Henry S. Thompson |
> for >>
|
Sun, 21 Oct 2018 12:41:10 +0000 |
Henry S. Thompson |
use /var/data, don't store but include subproc _inside_ tryread
|
Sat, 20 Oct 2018 16:13:58 +0000 |
Henry S. Thompson |
try to use dateparser.parse on non-standard last-modified
|
Sat, 20 Oct 2018 16:11:29 +0000 |
Henry S. Thompson |
lots of tweaking, reached the 80/20 point
|
Fri, 19 Oct 2018 14:25:19 +0000 |
Henry S. Thompson |
F2-related stuff, and new experiment
|
Fri, 19 Oct 2018 11:36:31 +0000 |
Henry S. Thompson |
first cut at http/https real trial, with month and year last-modified info too
|
Fri, 19 Oct 2018 11:34:26 +0000 |
Henry S. Thompson |
fix (1st, but not only?) fail when using after vmss restart
|
Thu, 18 Oct 2018 17:01:34 +0000 |
Henry S. Thompson |
make a local bin directory for each worker
|
Thu, 18 Oct 2018 14:30:04 +0000 |
Henry S. Thompson |
trying to make this work
|
Fri, 12 Oct 2018 08:51:50 +0000 |
Henry S. Thompson |
shrinkJSON.sh: minimise "jq ." output
|
Wed, 10 Oct 2018 11:28:21 +0000 |
Henry S. Thompson |
scan WAT files for application/pdf responses
|
Wed, 10 Oct 2018 11:27:06 +0000 |
Henry S. Thompson |
a bit more info in logs
|
Mon, 08 Oct 2018 13:17:23 +0000 |
Henry S. Thompson |
wrun.sh: usage catchup
|
Tue, 02 Oct 2018 10:52:45 +0000 |
Henry S. Thompson |
wrun.sh, invoke.sh:
|