Mercurial > hg > cc > cirrus_home
annotate bin/doC2S.sh @ 150:4e1364b8c89a
working, with compound driver files
author | Henry S. Thompson <ht@inf.ed.ac.uk> |
---|---|
date | Tue, 26 Oct 2021 14:05:35 +0000 |
parents | f0bee28995f1 |
children |
rev | line source |
---|---|
148 | 1 #!/usr/bin/bash |
2 # Usage: doC2S.sh node task cc resdir workd i | |
3 node=$1 | |
4 task=$2 | |
5 cc=$3 | |
6 resdir=$4 | |
7 workd=$5 | |
8 i=$6 | |
9 | |
10 echo "> $node.$task: $i" | |
11 rm -f $workd/cdx$i.db | |
12 cdx2sql.py /beegfs/common_crawl/$cc/cdx/warc $i 2>$workd/cdx$i.errs | \ | |
150
4e1364b8c89a
working, with compound driver files
Henry S. Thompson <ht@inf.ed.ac.uk>
parents:
148
diff
changeset
|
13 /lustre/home/dc007/hst/gentoo/usr/bin/sponge | \ |
4e1364b8c89a
working, with compound driver files
Henry S. Thompson <ht@inf.ed.ac.uk>
parents:
148
diff
changeset
|
14 sqlite3 $workd/cdx$i.db '.read results/cdx.sql' '.mode tabs' '.import /dev/stdin props' '.quit' 2>$workd/cdx$i.log |
148 | 15 echo "< $node.$task: $i"; |
16 | |
17 |