log

age author description
17 months ago Henry S. Thompson refactor datestream reading,
17 months ago Henry S. Thompson more faithful regexps and non-byte uri output
17 months ago Henry S. Thompson one uncommited fix from quentin
17 months ago Henry Thompson pass in debug flag(s) to merge_date.py
17 months ago Henry Thompson loosen must-match criterion in the both-messy case
17 months ago Henry Thompson one more sid fix,
17 months ago Henry S. Thompson working on sessionID pblms, still
17 months ago Henry Thompson first try
17 months ago Henry S. Thompson switch to gzip -7 to get comparable compressed cdx block size
17 months ago Henry S. Thompson use my own Canonicalizer to fix more obscure
17 months ago Henry S. Thompson re-instate logging splits for .idx
17 months ago Henry S. Thompson reinstate better check to start queuing,
17 months ago Henry S. Thompson bug4 fixed, but that created a new, earlier bug
17 months ago Henry S. Thompson rework handling of session key problem
17 months ago Henry S. Thompson initialise paths for csing
17 months ago Henry S. Thompson d'oh
17 months ago Henry S. Thompson include full URI in output
17 months ago Henry S. Thompson try to do csing correctly on compute nodes
17 months ago Henry S. Thompson version which outputs more identification,
17 months ago Henry S. Thompson last version before giving up on approach based only on key and datestamp
17 months ago Henry S. Thompson improve reordering, still failing on cdx-00004
17 months ago Henry S. Thompson attempt at reordering if necessary
17 months ago Henry S. Thompson mostly working, but need to reorder in case of cfid and friends
18 months ago Henry S. Thompson flip loops
18 months ago Henry S. Thompson merge a stream of ks files with a set of cdx files
18 months ago Henry S. Thompson final keystroke fixes, recurse and decimal www stripping
18 months ago Henry S. Thompson final keystroke fixes,
18 months ago Henry S. Thompson handle double .www, more keep-me chars