Mercurial > hg > cc > cirrus_work
graph
-
value from memory view workingSat, 18 Jan 2025 23:00:30 +0000, by Henry S. Thompson
-
try using cdb as C librarySat, 18 Jan 2025 21:25:17 +0000, by Henry S. Thompson
-
add some cython decoration, not much effectFri, 17 Jan 2025 20:37:10 +0000, by Henry S. Thompson
-
run with login shellFri, 17 Jan 2025 20:35:21 +0000, by Henry S. Thompson
-
tweak XEmacs font/key bindingsFri, 17 Jan 2025 20:34:32 +0000, by Henry S. Thompson
-
tweak XEmacs fontFri, 17 Jan 2025 19:58:04 +0000, by Henry S. Thompson
-
time the unpicklingThu, 02 Jan 2025 18:35:08 +0000, by Henry S. Thompson
-
with bloom prefilterThu, 02 Jan 2025 18:30:03 +0000, by Henry S. Thompson
-
try adding lm to existing index from ks_0-9Thu, 02 Jan 2025 14:52:14 +0000, by Henry S. Thompson
-
output bytes, pickle and save dict if -p, trim lm value to intThu, 02 Jan 2025 14:51:00 +0000, by Henry S. Thompson
-
test big dict for associating lm timestamp with cc timestamp+uriWed, 01 Jan 2025 23:02:35 +0000, by Henry S. Thompson
-
working together works well to provide what's needed to update a cdx to include lastmod where possibleThu, 03 Oct 2024 18:17:55 +0100, by Henry S. Thompson
-
make into a library, entry point def unpackz(infileName, callback, outfile = None),Wed, 02 Oct 2024 19:54:45 +0100, by Henry S. Thompson
-
cleaned up indentation to 2 spaces throughoutWed, 02 Oct 2024 11:09:58 +0100, by Henry S. Thompson
-
take bufsize from cmdlineWed, 02 Oct 2024 09:56:37 +0100, by Henry S. Thompson
-
eof pblms fixed, seems to workTue, 01 Oct 2024 15:59:26 +0100, by Henry S. Thompson
-
working, but last count/offset not being writtenSat, 28 Sep 2024 15:19:05 +0100, by Henry S. Thompson
-
fix error messageThu, 26 Sep 2024 17:54:12 +0100, by Henry S. Thompson
-
csing disabled for nowThu, 26 Sep 2024 12:38:34 +0100, by Henry S. Thompson
-
font hacking, see also lib/xemacs/common-init.elThu, 26 Sep 2024 12:29:27 +0100, by Henry S. Thompson
-
new default from CC themselvesThu, 26 Sep 2024 12:25:54 +0100, by Henry S. Thompson
-
for debugging?Thu, 26 Sep 2024 12:24:16 +0100, by Henry S. Thompson
-
for use in Stuttgart, maybeThu, 09 May 2024 12:36:57 +0100, by Henry S. Thompson
-
xxxSat, 02 Mar 2024 10:59:06 +0000, by Henry S. Thompson
-
mergeThu, 29 Feb 2024 15:01:10 +0000, by Henry S. Thompson
-
post-processingThu, 29 Feb 2024 15:01:02 +0000, by Henry S. Thompson
-
sicWed, 28 Feb 2024 18:31:52 +0000, by Henry S. Thompson
-
compute offset between LM and crawl timestampThu, 29 Feb 2024 14:59:50 +0000, by Henry S. Thompson
-
sicThu, 29 Feb 2024 14:59:09 +0000, by Henry S. Thompson
-
rebuild to match triple fig line colourWed, 28 Feb 2024 15:27:00 +0000, by Henry S. Thompson
-
rebuild with more consistent appearanceWed, 28 Feb 2024 15:13:38 +0000, by Henry S. Thompson
-
mergeWed, 28 Feb 2024 14:50:08 +0000, by Henry S. Thompson
-
replaced mean_lens by w or wo bogonWed, 28 Feb 2024 14:49:45 +0000, by Henry S. Thompson
-
now using clean 2005 countWed, 28 Feb 2024 14:44:59 +0000, by Henry S. Thompson
-
minor addition?Wed, 28 Feb 2024 10:32:01 +0000, by Henry Thompson
-
mergeWed, 28 Feb 2024 10:20:44 +0000, by Henry S. Thompson
-
what is this?Wed, 28 Feb 2024 10:15:56 +0000, by Henry S. Thompson
-
add percentage of non-latin by crawl tableTue, 20 Feb 2024 15:23:47 +0000, by Henry Thompson
-
tld change investigationFri, 16 Feb 2024 16:24:28 +0000, by Henry Thompson
-
nl1 and tld summary resultsFri, 16 Feb 2024 13:54:12 +0000, by Henry S. Thompson
-
correct UsageThu, 15 Feb 2024 22:31:09 +0000, by Henry S. Thompson
-
csing-related tweaksThu, 15 Feb 2024 22:30:40 +0000, by Henry S. Thompson
-
mergeThu, 15 Feb 2024 16:36:00 +0000, by Henry S. Thompson
-
see Paul:Documents/HTalks/WebSci2024Thu, 15 Feb 2024 15:10:34 +0000, by Henry S. Thompson
-
add some debugging infoThu, 11 Jan 2024 16:44:45 +0000, by Henry S. Thompson
-
use 2-digit suffixes,Thu, 11 Jan 2024 16:43:16 +0000, by Henry S. Thompson
-
sicFri, 08 Dec 2023 10:32:07 +0000, by Henry S. Thompson
-
sicThu, 07 Dec 2023 18:23:11 +0000, by Henry S. Thompson
-
added back missing yearsThu, 07 Dec 2023 18:21:48 +0000, by Henry S. Thompson
-
support semilogy from cmd lineThu, 07 Dec 2023 18:15:43 +0000, by Henry S. Thompson
-
means of all columns in length analysesWed, 06 Dec 2023 13:36:49 +0000, by Henry S. Thompson
-
normalise % counts by non-empty bases onlyWed, 06 Dec 2023 13:33:25 +0000, by Henry S. Thompson
-
new plots variousTue, 05 Dec 2023 19:49:29 +0000, by Henry S. Thompson
-
get single graph working, tweak params variousTue, 05 Dec 2023 19:49:11 +0000, by Henry S. Thompson
-
compute (component) uri lengths and a few other propertiesTue, 05 Dec 2023 10:35:15 +0000, by Henry S. Thompson
-
with three tracks from two yearsMon, 04 Dec 2023 19:06:13 +0000, by Henry S. Thompson
-
for pubMon, 04 Dec 2023 10:42:02 +0000, by Henry S. Thompson
-
tweaked formattingMon, 04 Dec 2023 10:40:47 +0000, by Henry S. Thompson
-
excel rewrote, no important changes (?)Mon, 04 Dec 2023 10:21:30 +0000, by Henry S. Thompson
-
replace wrong one with right oneMon, 04 Dec 2023 09:42:39 +0000, by Henry S. Thompson
-
mergeMon, 04 Dec 2023 09:37:14 +0000, by Henry S. Thompson
-
implement alternative confidence measure using stats.bootstrap,Mon, 04 Dec 2023 09:35:53 +0000, by Henry S. Thompson
-
for LMh percentileMon, 04 Dec 2023 09:33:13 +0000, by Henry S. Thompson
-
decoratedThu, 30 Nov 2023 14:42:46 +0000, by Henry S. Thompson
-
mergeThu, 30 Nov 2023 14:20:22 +0000, by Henry S. Thompson
-
can't add props to DescribeResultThu, 30 Nov 2023 14:18:56 +0000, by Henry S. Thompson
-
for 2023-40Thu, 30 Nov 2023 14:17:34 +0000, by Henry S. Thompson
-
with decorationsTue, 28 Nov 2023 18:40:38 +0000, by Henry S. Thompson
-
excel rewrote, no important changes (?)Tue, 28 Nov 2023 18:40:17 +0000, by Henry S. Thompson
-
with percentile instead of raw mean correlTue, 28 Nov 2023 10:23:20 +0000, by Henry S. Thompson
-
change heatmap to by percentileTue, 28 Nov 2023 10:22:38 +0000, by Henry S. Thompson
-
with heatTue, 28 Nov 2023 10:21:36 +0000, by Henry S. Thompson
-
heat map for mime vs. nl1 vs. lenMon, 27 Nov 2023 22:15:39 +0000, by Henry S. Thompson
-
add head_map fnMon, 27 Nov 2023 22:14:53 +0000, by Henry S. Thompson
-
add explore_deltas and predict analysis fnsMon, 27 Nov 2023 18:25:39 +0000, by Henry S. Thompson
-
rename to avoid name clash with scipy.statsSun, 26 Nov 2023 21:24:38 +0000, by Henry S. Thompson
-
move to class with local vars instead of many globalsFri, 24 Nov 2023 20:41:03 +0000, by Henry S. Thompson
-
renamed to by_interval.pyFri, 24 Nov 2023 20:40:09 +0000, by Henry S. Thompson
-
renamed from spearman.pyFri, 24 Nov 2023 20:39:08 +0000, by Henry S. Thompson
-
renamed to stats.pyFri, 24 Nov 2023 20:38:39 +0000, by Henry S. Thompson
-
do the __main__ thingFri, 24 Nov 2023 19:52:52 +0000, by Henry S. Thompson
-
put results in numbered subdirsFri, 24 Nov 2023 19:52:14 +0000, by Henry S. Thompson
-
add minimal logging and don't return until finishedFri, 24 Nov 2023 19:50:12 +0000, by Henry S. Thompson
-
should work for months also nowWed, 15 Nov 2023 10:24:32 +0000, by Henry S. Thompson
-
cross-language confusion :-)Wed, 15 Nov 2023 09:36:23 +0000, by Henry S. Thompson
-
LM plot for multiple crawls, magnitude or %ageMon, 06 Nov 2023 15:55:57 +0000, by Henry S. Thompson
-
can overlay the twoFri, 03 Nov 2023 19:05:54 +0000, by Henry S. Thompson
-
fix output yearThu, 02 Nov 2023 15:38:39 +0000, by Henry S. Thompson
-
sicThu, 02 Nov 2023 13:49:02 +0000, by Henry S. Thompson
-
sicTue, 31 Oct 2023 14:05:12 +0000, by Henry S. Thompson
-
get in/out file management working rightTue, 31 Oct 2023 14:04:24 +0000, by Henry S. Thompson
-
refactor to provide for buffer overflow fixTue, 31 Oct 2023 14:03:02 +0000, by Henry S. Thompson
-
bug-fix wrt 1st time,Tue, 31 Oct 2023 14:01:50 +0000, by Henry S. Thompson
-
make extra file info optionalMon, 30 Oct 2023 12:19:53 +0000, by Henry S. Thompson
-
forget parallel, just do (default 2) parallel single threadsWed, 25 Oct 2023 23:01:59 +0100, by Henry S. Thompson
-
add missing makedirWed, 25 Oct 2023 23:00:45 +0100, by Henry S. Thompson
-
now does one named segment onlyTue, 24 Oct 2023 16:59:23 +0100, by Henry S. Thompson
-
resurrect parallel fetchTue, 24 Oct 2023 16:58:44 +0100, by Henry S. Thompson
-
convert to single thread,Tue, 24 Oct 2023 14:34:58 +0100, by Henry S. Thompson
-
avoid global name conflictTue, 24 Oct 2023 14:26:36 +0100, by Henry S. Thompson
-
moved from /beegfs/common-crawl to get under .hgWed, 11 Oct 2023 12:51:06 +0100, by Henry S. Thompson
-
fix typoWed, 11 Oct 2023 12:50:29 +0100, by Henry S. Thompson
-
build cluster.idxFri, 06 Oct 2023 15:06:53 +0100, by Henry S. Thompson
-
no longer using cmp_to_keyFri, 06 Oct 2023 15:05:55 +0100, by Henry S. Thompson
-
handle -m case, support src from cmdline mergefixWed, 04 Oct 2023 20:04:34 +0100, by Henry S. Thompson
-
new branch to save do_idx.sh from abandoned merge fixup mergefixThu, 05 Oct 2023 10:42:15 +0100, by Henry S. Thompson
-
try to get the counts right, particularly when re-mergingWed, 04 Oct 2023 18:53:55 +0100, by Henry S. Thompson
-
for use in debugging, see notes and tests 2, 17, merge testWed, 04 Oct 2023 18:51:56 +0100, by Henry S. Thompson
-
add various www deletion casesTue, 03 Oct 2023 17:45:57 +0100, by Henry S. Thompson
-
iterate WPAT fix with improved patternTue, 03 Oct 2023 17:44:59 +0100, by Henry S. Thompson
-
loosen WARC pattern to avoid failure from "mime" = "{...}" interveningTue, 03 Oct 2023 17:43:52 +0100, by Henry S. Thompson
-
refactor to enable rerun with fixup,Mon, 02 Oct 2023 18:56:50 +0100, by Henry S. Thompson
-
correct mistaken futnsz test,Mon, 02 Oct 2023 18:55:48 +0100, by Henry S. Thompson
-
change path to merge_date.pyMon, 02 Oct 2023 18:54:10 +0100, by Henry S. Thompson
-
remove the mistaken deletion of NONPRINT,Mon, 02 Oct 2023 18:52:43 +0100, by Henry S. Thompson
-
fix a bad fix and a bad test for the televida caseSat, 30 Sep 2023 18:04:15 +0100, by Henry S. Thompson
-
fix and test for all-decimal hostSat, 30 Sep 2023 14:13:19 +0100, by Henry S. Thompson
-
no import in lmh.__init__ any moreSat, 30 Sep 2023 14:12:39 +0100, by Henry S. Thompson
-
importing in __init__ causes problemsSat, 30 Sep 2023 14:11:49 +0100, by Henry S. Thompson
-
commented out duplicate, handle comments betterFri, 29 Sep 2023 15:59:34 +0100, by Henry S. Thompson