comparison lurid3/notes.txt @ 70:db142018ff9e default tip

build _17_ .cdb files
author Henry S. Thompson <ht@inf.ed.ac.uk>
date Tue, 11 Feb 2025 17:10:31 +0000
parents fb3dcd144e59
children
comparison
equal deleted inserted replaced
69:fb3dcd144e59 70:db142018ff9e
1508 95 31385531 1508 95 31385531
1509 >: wc -l ks_divs.tsv 1509 >: wc -l ks_divs.tsv
1510 16 ks_divs.tsv 1510 16 ks_divs.tsv
1511 1511
1512 So, has to be 17 :-( 1512 So, has to be 17 :-(
1513
1514 >: cut -f 1 ../ks_divs.tsv | { i=0; while read j; do printf "%s\t%s\n" $i $j; i=$((j+1)); done ; printf "%s\t%s\n" $i 99 ; } | parallel --colsep "\t" 'echo cat ../\{{1}..{2}\}/ks.tsv \> ks_{1}-{2}.tsv \&'
1515 [couldn't make this work as written, hence the echo, followed by copy-paste]
1516 >: cut -f 1 ../ks_divs.tsv | { i=0; while read j; do printf "ks_%s-%s\n" $i $j; i=$((j+1)); done ; printf "ks_%s-%s\n" $i 99; } | parallel -j 9 '~/lib/python/cc/lmh/ks2cdb.py -f {}.tsv -c {}.cdb_in' &
1517 >: cut -f 1 ../ks_divs.tsv | { i=0; while read j; do printf "ks_%s-%s\n" $i $j; i=$((j+1)); done ; printf "ks_%s-%s\n" $i 99; } | parallel -j 9 'cdbmake {}.cdb {}.tmp <{}.cdb_in' &
1518 >: cut -f 1 ../ks_divs.tsv | { i=0; while read j; do printf "ks_%s-%s\n" $i $j; i=$((j+1)); done ; printf "ks_%s-%s\n" $i 99; } | parallel -j 9 'cdb < {}.cdb' > cdbtest.out &
1513 ================ 1519 ================
1514 1520
1515 Try it with the existing _per segment_ index we have for 2019-35 1521 Try it with the existing _per segment_ index we have for 2019-35
1516 1522
1517 Assuming we have to key on segment / file and offset, as reconstructing the 1523 Assuming we have to key on segment / file and offset, as reconstructing the