log

age author description
4 weeks ago Henry S. Thompson use cdb library directly,
4 weeks ago Henry S. Thompson use cdb library directly
5 weeks ago Henry S. Thompson works with big (ks_0-9.60.cdb) cdb file
5 weeks ago Henry S. Thompson finally get test code separated from db.pyx to work
5 weeks ago Henry S. Thompson cython header file for db.pyx
5 weeks ago Henry S. Thompson remove the testing code, leaving just the class
5 weeks ago Henry S. Thompson prepare a ks..tsv file for indexing into a cdb
5 weeks ago Henry S. Thompson renamed cpython class Cdb to CCdb to avoid name conflict with cdb.Cdb
5 weeks ago Henry S. Thompson work with libcdb.a
6 weeks ago Henry S. Thompson value from memory view working
6 weeks ago Henry S. Thompson try using cdb as C library
6 weeks ago Henry S. Thompson add some cython decoration, not much effect
6 weeks ago Henry S. Thompson run with login shell
6 weeks ago Henry S. Thompson tweak XEmacs font/key bindings
6 weeks ago Henry S. Thompson tweak XEmacs font
2 months ago Henry S. Thompson time the unpickling
2 months ago Henry S. Thompson with bloom prefilter
2 months ago Henry S. Thompson try adding lm to existing index from ks_0-9
2 months ago Henry S. Thompson output bytes, pickle and save dict if -p, trim lm value to int
2 months ago Henry S. Thompson test big dict for associating lm timestamp with cc timestamp+uri
5 months ago Henry S. Thompson working together works well to provide what's needed to update a cdx to include lastmod where possible
5 months ago Henry S. Thompson make into a library, entry point def unpackz(infileName, callback, outfile = None),
5 months ago Henry S. Thompson cleaned up indentation to 2 spaces throughout
5 months ago Henry S. Thompson take bufsize from cmdline
5 months ago Henry S. Thompson eof pblms fixed, seems to work
5 months ago Henry S. Thompson working, but last count/offset not being written
5 months ago Henry S. Thompson fix error message
5 months ago Henry S. Thompson csing disabled for now
5 months ago Henry S. Thompson font hacking, see also lib/xemacs/common-init.el
5 months ago Henry S. Thompson new default from CC themselves
5 months ago Henry S. Thompson for debugging?
10 months ago Henry S. Thompson for use in Stuttgart, maybe
12 months ago Henry S. Thompson xxx
12 months ago Henry S. Thompson merge
12 months ago Henry S. Thompson post-processing
12 months ago Henry S. Thompson sic
12 months ago Henry S. Thompson compute offset between LM and crawl timestamp
12 months ago Henry S. Thompson sic
12 months ago Henry S. Thompson rebuild to match triple fig line colour
12 months ago Henry S. Thompson rebuild with more consistent appearance
12 months ago Henry S. Thompson merge
12 months ago Henry S. Thompson replaced mean_lens by w or wo bogon
12 months ago Henry S. Thompson now using clean 2005 count
12 months ago Henry Thompson minor addition?
12 months ago Henry S. Thompson merge
12 months ago Henry S. Thompson what is this?
12 months ago Henry Thompson add percentage of non-latin by crawl table
12 months ago Henry Thompson tld change investigation
12 months ago Henry S. Thompson nl1 and tld summary results
12 months ago Henry S. Thompson correct Usage
12 months ago Henry S. Thompson csing-related tweaks
12 months ago Henry S. Thompson merge
12 months ago Henry S. Thompson see Paul:Documents/HTalks/WebSci2024
13 months ago Henry S. Thompson add some debugging info
13 months ago Henry S. Thompson use 2-digit suffixes,
15 months ago Henry S. Thompson sic
15 months ago Henry S. Thompson sic
15 months ago Henry S. Thompson added back missing years
15 months ago Henry S. Thompson support semilogy from cmd line
15 months ago Henry S. Thompson means of all columns in length analyses
15 months ago Henry S. Thompson normalise % counts by non-empty bases only
15 months ago Henry S. Thompson new plots various
15 months ago Henry S. Thompson get single graph working, tweak params various
15 months ago Henry S. Thompson compute (component) uri lengths and a few other properties