Mercurial > hg > cc > pub
diff index.xml @ 4:268fe5fd117f
add slides
author | Henry Thompson <ht@markup.co.uk> |
---|---|
date | Wed, 22 May 2024 17:18:23 +0200 |
parents | d6f13dda3a11 |
children | cc5cef8ba548 |
line wrap: on
line diff
--- a/index.xml Wed May 22 17:14:13 2024 +0200 +++ b/index.xml Wed May 22 17:18:23 2024 +0200 @@ -5,7 +5,7 @@ <head> <title>Augmentations to Common Crawl</title> <author>Henry S. Thompson</author> - <date>15 Apr 2024</date> + <date>22 May 2024</date> </head> <body> <div> @@ -22,6 +22,7 @@ <item><link href="CC-MAIN-2019-35/cdx/warc/idx/">The directory containing the individual gzipped index files themselves</link>, with names of the form <code>cdx-00nnn.gz</code>, for <code>nnn</code> in <code>000–299</code></item> + <item><link href="Thompson_WebSci24_slides.pdf">WebSci 24 conference slides</link></item> </list> </div> <div> @@ -32,7 +33,8 @@ <item term="For the paper"><display>Henry S. Thompson. 2024. "Improved methodology for longitudinal Web analytics using Common Crawl". In <emph>ACM Web Science Conference (Websci ’24)</emph>, May 21–24, 2024, Stuttgart, Germany. ACM, New York, NY, USA, 11 pages. -<link href="...">[coming soon]</link> </display><!--https://doi.org/10.1145/3614419.3644018--></item> +<link href="...">[coming soon]</link> +</display><!--https://doi.org/10.1145/3614419.3644018--></item> <item term="For the data"><display>Henry S. Thompson. 2024. <emph>Augmented index for Common Crawl August 2019, with Last-Modified timestamps</emph>. <link href="https://markup.co.uk/ccrawl/">https://markup.co.uk/ccrawl/</link>. Retrieved ...</display></item>