Internet Archive's Sparkling Data Processing Library
updated at April 4, 2024, 12:42 a.m.
WARC and ARC indexing and discovery tools.
updated at March 31, 2024, 2:13 p.m.
Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.
updated at March 26, 2024, 10:50 p.m.
Convert HTTP Archive (HAR) -> Web Archive (WARC) format
updated at March 12, 2024, 12:41 p.m.
Create Robust Links from within Zotero
updated at Feb. 22, 2024, 6:58 p.m.
golang readers for ARC and WARC webarchive formats
updated at Feb. 6, 2024, 11:28 p.m.
Various examples of notebooks for working with web archives with the Archives Unleashed Toolkit, and derivatives generated by the Archives Unleashed Toolkit.
updated at Jan. 21, 2024, 10:04 a.m.
CLI implementation of httpreserve that can test links and retrieve internet archive replacements
updated at Nov. 18, 2023, 5:02 p.m.