Mercurial > hg > cc > cirrus_work
graph
-
compute (component) uri lengths and a few other properties15 months ago, by Henry S. Thompson
-
with three tracks from two years15 months ago, by Henry S. Thompson
-
for pub15 months ago, by Henry S. Thompson
-
tweaked formatting15 months ago, by Henry S. Thompson
-
excel rewrote, no important changes (?)15 months ago, by Henry S. Thompson
-
replace wrong one with right one15 months ago, by Henry S. Thompson
-
merge15 months ago, by Henry S. Thompson
-
implement alternative confidence measure using stats.bootstrap,15 months ago, by Henry S. Thompson
-
for LMh percentile15 months ago, by Henry S. Thompson
-
decorated15 months ago, by Henry S. Thompson
-
merge15 months ago, by Henry S. Thompson
-
can't add props to DescribeResult15 months ago, by Henry S. Thompson
-
for 2023-4015 months ago, by Henry S. Thompson
-
with decorations15 months ago, by Henry S. Thompson
-
excel rewrote, no important changes (?)15 months ago, by Henry S. Thompson
-
with percentile instead of raw mean correl15 months ago, by Henry S. Thompson
-
change heatmap to by percentile15 months ago, by Henry S. Thompson
-
with heat15 months ago, by Henry S. Thompson
-
heat map for mime vs. nl1 vs. len15 months ago, by Henry S. Thompson
-
add head_map fn15 months ago, by Henry S. Thompson
-
add explore_deltas and predict analysis fns15 months ago, by Henry S. Thompson
-
rename to avoid name clash with scipy.stats15 months ago, by Henry S. Thompson
-
move to class with local vars instead of many globals15 months ago, by Henry S. Thompson
-
renamed to by_interval.py15 months ago, by Henry S. Thompson
-
renamed from spearman.py15 months ago, by Henry S. Thompson
-
renamed to stats.py15 months ago, by Henry S. Thompson
-
do the __main__ thing15 months ago, by Henry S. Thompson
-
put results in numbered subdirs15 months ago, by Henry S. Thompson
-
add minimal logging and don't return until finished15 months ago, by Henry S. Thompson
-
should work for months also now15 months ago, by Henry S. Thompson
-
cross-language confusion :-)15 months ago, by Henry S. Thompson
-
LM plot for multiple crawls, magnitude or %age16 months ago, by Henry S. Thompson
-
can overlay the two16 months ago, by Henry S. Thompson
-
fix output year16 months ago, by Henry S. Thompson
-
sic16 months ago, by Henry S. Thompson
-
sic16 months ago, by Henry S. Thompson
-
get in/out file management working right16 months ago, by Henry S. Thompson
-
refactor to provide for buffer overflow fix16 months ago, by Henry S. Thompson
-
bug-fix wrt 1st time,16 months ago, by Henry S. Thompson
-
make extra file info optional16 months ago, by Henry S. Thompson
-
forget parallel, just do (default 2) parallel single threads16 months ago, by Henry S. Thompson
-
add missing makedir16 months ago, by Henry S. Thompson
-
now does one named segment only16 months ago, by Henry S. Thompson
-
resurrect parallel fetch16 months ago, by Henry S. Thompson
-
convert to single thread,16 months ago, by Henry S. Thompson
-
avoid global name conflict16 months ago, by Henry S. Thompson
-
moved from /beegfs/common-crawl to get under .hg17 months ago, by Henry S. Thompson
-
fix typo17 months ago, by Henry S. Thompson
-
build cluster.idx17 months ago, by Henry S. Thompson
-
no longer using cmp_to_key17 months ago, by Henry S. Thompson
-
new branch to save do_idx.sh from abandoned merge fixup mergefix17 months ago, by Henry S. Thompson
-
try to get the counts right, particularly when re-merging17 months ago, by Henry S. Thompson
-
for use in debugging, see notes and tests 2, 17, merge test17 months ago, by Henry S. Thompson
-
add various www deletion cases17 months ago, by Henry S. Thompson
-
iterate WPAT fix with improved pattern17 months ago, by Henry S. Thompson
-
loosen WARC pattern to avoid failure from "mime" = "{...}" intervening17 months ago, by Henry S. Thompson
-
refactor to enable rerun with fixup,17 months ago, by Henry S. Thompson
-
correct mistaken futnsz test,17 months ago, by Henry S. Thompson
-
change path to merge_date.py17 months ago, by Henry S. Thompson
-
remove the mistaken deletion of NONPRINT,17 months ago, by Henry S. Thompson
-
fix a bad fix and a bad test for the televida case17 months ago, by Henry S. Thompson
-
fix and test for all-decimal host17 months ago, by Henry S. Thompson
-
no import in lmh.__init__ any more17 months ago, by Henry S. Thompson
-
importing in __init__ causes problems17 months ago, by Henry S. Thompson
-
commented out duplicate, handle comments better17 months ago, by Henry S. Thompson
-
more corner case tests17 months ago, by Henry S. Thompson
-
tweaks to get all tests through #1417 months ago, by Henry S. Thompson
-
get 7f (two cases) and %25 working17 months ago, by Henry S. Thompson
-
add televida case test17 months ago, by Henry S. Thompson
-
add test description17 months ago, by Henry S. Thompson
-
importable just in case17 months ago, by Henry S. Thompson
-
move most of the hacking into fixGoogleCanon,17 months ago, by Henry S. Thompson
-
forget assert, allow multiple failures17 months ago, by Henry S. Thompson
-
x17 months ago, by Henry S. Thompson
-
found right place for \x7f hack, maybe17 months ago, by Henry S. Thompson
-
readability17 months ago, by Henry S. Thompson
-
x17 months ago, by Henry S. Thompson
-
refactor to sort a module in an lmh package17 months ago, by Henry S. Thompson
-
start some regression tests17 months ago, by Henry S. Thompson
-
creating lmh package17 months ago, by Henry S. Thompson
-
moved from bin17 months ago, by Henry S. Thompson
-
minor bug wrt EOF of final cdx input file17 months ago, by Henry S. Thompson
-
replicate two extremely-corner cases of the way17 months ago, by Henry S. Thompson
-
a bit more logging17 months ago, by Henry S. Thompson
-
a bit more logging17 months ago, by Henry S. Thompson
-
robotstxt and crawldiagnostics get free ride,17 months ago, by Henry S. Thompson
-
a few more from ecclerig,17 months ago, by Henry S. Thompson
-
refactor datestream reading,17 months ago, by Henry S. Thompson
-
more faithful regexps and non-byte uri output17 months ago, by Henry S. Thompson
-
one uncommited fix from quentin17 months ago, by Henry S. Thompson
-
pass in debug flag(s) to merge_date.py17 months ago, by Henry Thompson
-
loosen must-match criterion in the both-messy case17 months ago, by Henry Thompson
-
one more sid fix,17 months ago, by Henry Thompson
-
working on sessionID pblms, still17 months ago, by Henry S. Thompson
-
first try18 months ago, by Henry Thompson
-
switch to gzip -7 to get comparable compressed cdx block size18 months ago, by Henry S. Thompson
-
use my own Canonicalizer to fix more obscure18 months ago, by Henry S. Thompson
-
re-instate logging splits for .idx18 months ago, by Henry S. Thompson
-
reinstate better check to start queuing,18 months ago, by Henry S. Thompson
-
bug4 fixed, but that created a new, earlier bug18 months ago, by Henry S. Thompson
-
rework handling of session key problem18 months ago, by Henry S. Thompson
-
initialise paths for csing18 months ago, by Henry S. Thompson
-
d'oh18 months ago, by Henry S. Thompson
-
include full URI in output18 months ago, by Henry S. Thompson
-
try to do csing correctly on compute nodes18 months ago, by Henry S. Thompson
-
version which outputs more identification,18 months ago, by Henry S. Thompson
-
last version before giving up on approach based only on key and datestamp18 months ago, by Henry S. Thompson
-
improve reordering, still failing on cdx-0000418 months ago, by Henry S. Thompson
-
attempt at reordering if necessary18 months ago, by Henry S. Thompson
-
mostly working, but need to reorder in case of cfid and friends18 months ago, by Henry S. Thompson
-
flip loops18 months ago, by Henry S. Thompson
-
merge a stream of ks files with a set of cdx files18 months ago, by Henry S. Thompson
-
final keystroke fixes, recurse and decimal www stripping18 months ago, by Henry S. Thompson
-
final keystroke fixes,18 months ago, by Henry S. Thompson
-
handle double .www, more keep-me chars18 months ago, by Henry S. Thompson
-
work-around for weird handling of %-encoding in Java impl. of SURT18 months ago, by Henry S. Thompson
-
merge, including pointless fix wrt pq18 months ago, by Henry Thompson
-
use surt instead of trying to create index term by hand18 months ago, by Henry Thompson
-
merge18 months ago, by Henry Thompson
-
stale18 months ago, by Henry Thompson
-
catching up by hand with markup version,18 months ago, by Henry Thompson
-
include timestamp18 months ago, by Henry S. Thompson
-
include query18 months ago, by Henry S. Thompson
-
make CC's own sorting explicit18 months ago, by Henry S. Thompson
-
handle corner cases with final . and initial www..+19 months ago, by Henry S. Thompson
-
handle %-encoded utf-8 as idna19 months ago, by Henry S. Thompson
-
merge19 months ago, by Henry S. Thompson