132
|
1 Various tools and bash function sources.
|
|
2
|
|
3 All the tools will give useful output if run with a --help argument
|
|
4
|
|
5 functions.sh Source this in your .bashrc to get useful functions
|
|
6 including ux, lss and btot
|
|
7
|
|
8 cdx2tsv.py Extract fields and subparts from fields of a CDX-format
|
|
9 index file
|
|
10
|
|
11 clm.sh Intended for use as a sub-command to ix.py: Given an
|
|
12 HTML response header, appends to a given file the Last-Modified value
|
|
13 if there is one, otherwise a blank line.
|
|
14
|
|
15 ix.py Efficiently extract some or all of response data contents of
|
|
16 Common Crawl WARC-format files
|
|
17
|
|
18 qpdf Wrapper for locally compiled version.
|
|
19
|
|
20 Qpdf as supplied only works with a named file, but this
|
|
21 wrapper supports streamed input.
|
|
22 _If_ it's invoked as
|
|
23 qpdf [args...] -
|
|
24 it takes input from stdin, saves it as /dev/shm/$USER/xxx.pdf
|
|
25 and runs
|
|
26 qpdf args... /dev/shm/$USER/xxx.pdf
|
|
27
|
|
28 Qpdf is the best available PDF validator
|
|
29 as far as I know. See
|
|
30 http://qpdf.sourceforge.net/files/qpdf-manual.html
|
|
31 for documentation.
|
|
32
|
|
33 qpdf_check Runs qpdf with all the arguments needed to
|
|
34 make it run as a validator: no corrections are appied,
|
|
35 no warnings are output,
|
|
36 fails iff there are any errors in the input file.
|
|
37
|
|
38 Uses the above qpdf wrapper, so supports input either
|
|
39 from stdin or a named file
|