Mercurial > hg > cc > azure
view master/src/wecu/run.sh @ 58:a3edba8dab11
move to right place in tree
author | Henry S. Thompson <ht@markup.co.uk> |
---|---|
date | Thu, 28 May 2020 09:56:42 +0000 |
parents | master/wecu/run.sh@ac1a20e627a9 |
children |
line wrap: on
line source
cores=`cat cores.txt` time parallel \ --sshloginfile hosts \ --transferfile mapper.py \ --transferfile reducer.py \ --will-cite \ --retries 3 \ --jobs $cores \ --workdir $PWD \ -a input_paths \ 'curl -s -N "https://commoncrawl.s3.amazonaws.com/{}" | unpigz -dp 1 -c | ./mapper.py' | \ sort | \ ./reducer.py