Thu, 02 Jan 2025 14:52:14 +0000 |
Henry S. Thompson |
try adding lm to existing index from ks_0-9
|
Thu, 02 Jan 2025 14:51:00 +0000 |
Henry S. Thompson |
output bytes, pickle and save dict if -p, trim lm value to int
|
Wed, 01 Jan 2025 23:02:35 +0000 |
Henry S. Thompson |
test big dict for associating lm timestamp with cc timestamp+uri
|
Thu, 03 Oct 2024 18:17:55 +0100 |
Henry S. Thompson |
working together works well to provide what's needed to update a cdx to include lastmod where possible
|
Wed, 02 Oct 2024 19:54:45 +0100 |
Henry S. Thompson |
make into a library, entry point def unpackz(infileName, callback, outfile = None),
|
Wed, 02 Oct 2024 11:09:58 +0100 |
Henry S. Thompson |
cleaned up indentation to 2 spaces throughout
|
Wed, 02 Oct 2024 09:56:37 +0100 |
Henry S. Thompson |
take bufsize from cmdline
|
Tue, 01 Oct 2024 15:59:26 +0100 |
Henry S. Thompson |
eof pblms fixed, seems to work
|
Sat, 28 Sep 2024 15:19:05 +0100 |
Henry S. Thompson |
working, but last count/offset not being written
|
Thu, 26 Sep 2024 17:54:12 +0100 |
Henry S. Thompson |
fix error message
|
Thu, 26 Sep 2024 12:38:34 +0100 |
Henry S. Thompson |
csing disabled for now
|
Thu, 26 Sep 2024 12:29:27 +0100 |
Henry S. Thompson |
font hacking, see also lib/xemacs/common-init.el
|
Thu, 26 Sep 2024 12:25:54 +0100 |
Henry S. Thompson |
new default from CC themselves
|
Thu, 26 Sep 2024 12:24:16 +0100 |
Henry S. Thompson |
for debugging?
|
Thu, 09 May 2024 12:36:57 +0100 |
Henry S. Thompson |
for use in Stuttgart, maybe
|
Sat, 02 Mar 2024 10:59:06 +0000 |
Henry S. Thompson |
xxx
|
Thu, 29 Feb 2024 15:01:10 +0000 |
Henry S. Thompson |
merge
|
Thu, 29 Feb 2024 15:01:02 +0000 |
Henry S. Thompson |
post-processing
|
Wed, 28 Feb 2024 18:31:52 +0000 |
Henry S. Thompson |
sic
|
Thu, 29 Feb 2024 14:59:50 +0000 |
Henry S. Thompson |
compute offset between LM and crawl timestamp
|
Thu, 29 Feb 2024 14:59:09 +0000 |
Henry S. Thompson |
sic
|
Wed, 28 Feb 2024 15:27:00 +0000 |
Henry S. Thompson |
rebuild to match triple fig line colour
|
Wed, 28 Feb 2024 15:13:38 +0000 |
Henry S. Thompson |
rebuild with more consistent appearance
|
Wed, 28 Feb 2024 14:50:08 +0000 |
Henry S. Thompson |
merge
|
Wed, 28 Feb 2024 14:49:45 +0000 |
Henry S. Thompson |
replaced mean_lens by w or wo bogon
|
Wed, 28 Feb 2024 14:44:59 +0000 |
Henry S. Thompson |
now using clean 2005 count
|
Wed, 28 Feb 2024 10:32:01 +0000 |
Henry Thompson |
minor addition?
|
Wed, 28 Feb 2024 10:20:44 +0000 |
Henry S. Thompson |
merge
|
Wed, 28 Feb 2024 10:15:56 +0000 |
Henry S. Thompson |
what is this?
|
Tue, 20 Feb 2024 15:23:47 +0000 |
Henry Thompson |
add percentage of non-latin by crawl table
|