| Wed, 21 May 2025 22:05:24 +0100 |
Henry S. Thompson |
double buffer size to deal with massive header cases
trim tip
|
| Sun, 18 May 2025 13:08:04 +0100 |
Henry S. Thompson |
replace xargs with an explicit serial loop plus wait
trim
|
| Sat, 17 May 2025 11:12:46 +0100 |
Henry S. Thompson |
get rm in loop in right place, ensure unique pipe names
trim
|
| Tue, 13 May 2025 14:44:15 +0100 |
Henry S. Thompson |
working
trim
|
| Tue, 13 May 2025 13:32:26 +0100 |
Henry S. Thompson |
fixed time-stamp fixup bugs
trim
|
| Tue, 13 May 2025 12:06:01 +0100 |
Henry S. Thompson |
sic
trim
|
| Tue, 13 May 2025 12:05:22 +0100 |
Henry S. Thompson |
adapt to new configuration
trim
|
| Tue, 13 May 2025 12:04:01 +0100 |
Henry S. Thompson |
works
trim
|
| Thu, 08 May 2025 19:00:26 +0100 |
Henry S. Thompson |
just starting
trim
|
| Tue, 06 May 2025 16:52:32 +0100 |
Henry S. Thompson |
try trimming various more-or-less constant bits of the key and value
trim
|
| Mon, 05 May 2025 20:57:46 +0100 |
Henry S. Thompson |
robotstxt now working?
default
|
| Mon, 05 May 2025 20:57:30 +0100 |
Henry S. Thompson |
add another digit or two (segment #) to key for r_t
|
| Mon, 05 May 2025 20:39:16 +0100 |
Henry S. Thompson |
better font
|
| Wed, 23 Apr 2025 11:03:48 +0100 |
Henry S. Thompson |
still hacking var bindings...
|
| Tue, 22 Apr 2025 14:32:07 +0100 |
Henry S. Thompson |
job index arg to doit, slightly better diagnostic output
|
| Fri, 18 Apr 2025 13:39:55 +0100 |
Henry S. Thompson |
extend, then fix, to get it working for crawldiagnostics warc files
|
| Wed, 09 Apr 2025 20:42:29 +0100 |
Henry S. Thompson |
fix another long-tail bug
|
| Wed, 09 Apr 2025 17:15:40 +0100 |
Henry S. Thompson |
accommodate to change to digits for record type,
|
| Wed, 09 Apr 2025 12:57:50 +0100 |
Henry S. Thompson |
simple refill working?
|
| Wed, 09 Apr 2025 11:15:14 +0100 |
Henry S. Thompson |
try simpler refill
|
| Tue, 08 Apr 2025 16:06:33 +0100 |
Henry S. Thompson |
park that, try fixed large buffer and large-enough min to ensure we always have a whole record in view
|
| Mon, 07 Apr 2025 16:34:31 +0100 |
Henry S. Thompson |
in the midst of trying to rethink the refill logic
|
| Mon, 24 Mar 2025 14:30:32 +0000 |
Henry S. Thompson |
trying to recover from partial, not-ordered, run of segs 0--7
|
| Sat, 08 Mar 2025 22:31:14 +0000 |
Henry S. Thompson |
fix GMT fix,
|
| Fri, 07 Mar 2025 21:17:47 +0000 |
Henry S. Thompson |
try to do the whole thing in one go
|
| Fri, 07 Mar 2025 18:15:41 +0000 |
Henry S. Thompson |
type decls, cythonize works
|
| Fri, 07 Mar 2025 15:39:36 +0000 |
Henry S. Thompson |
type decls, cythonize works
|
| Wed, 05 Mar 2025 23:29:25 +0000 |
Henry S. Thompson |
automate a cdb chain
|
| Thu, 27 Feb 2025 18:23:31 +0000 |
Henry S. Thompson |
move final report to stderr
|
| Thu, 27 Feb 2025 18:23:05 +0000 |
Henry S. Thompson |
work with cdb logging, not sure why it was necessary
|