annotate data/00README @ 139:e96d444b0f84

fixed bug(s) wrt large payload files
author Henry S. Thompson <ht@inf.ed.ac.uk>
date Fri, 23 Jul 2021 22:19:15 +0000
parents 128b18459f9e
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
132
Henry S. Thompson <ht@inf.ed.ac.uk>
parents:
diff changeset
1 *CC-MAIN-2019-35/*
Henry S. Thompson <ht@inf.ed.ac.uk>
parents:
diff changeset
2
Henry S. Thompson <ht@inf.ed.ac.uk>
parents:
diff changeset
3 Around 100 sample WARC files, all the index files, index file for some
Henry S. Thompson <ht@inf.ed.ac.uk>
parents:
diff changeset
4 pdfs
Henry S. Thompson <ht@inf.ed.ac.uk>
parents:
diff changeset
5
Henry S. Thompson <ht@inf.ed.ac.uk>
parents:
diff changeset
6 *bin/*
Henry S. Thompson <ht@inf.ed.ac.uk>
parents:
diff changeset
7
Henry S. Thompson <ht@inf.ed.ac.uk>
parents:
diff changeset
8 Release version of various tools
Henry S. Thompson <ht@inf.ed.ac.uk>
parents:
diff changeset
9
Henry S. Thompson <ht@inf.ed.ac.uk>
parents:
diff changeset
10 See 00README files in subdirectories for more information.