WarcDB: Web crawl data as SQLite databases.
created at May 29, 2022, 11:09 a.m.
Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more accessible for all!
created at June 28, 2021, 10:46 p.m.
Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.
created at March 9, 2015, 8:32 p.m.
A robust web archive analytics toolkit
created at June 22, 2021, 9:03 a.m.
A list of things related to software, literature, and other content for 🕣 Memento
created at Sept. 16, 2016, 1:33 a.m.
Convert HTTP Archive (HAR) -> Web Archive (WARC) format
created at March 16, 2017, 12:14 a.m.
golang readers for ARC and WARC webarchive formats
created at Sept. 21, 2015, 6:38 a.m.
🍨 High-fidelity, browser-based, single-page web archiving library and CLI for witnessing the web.
created at Sept. 20, 2022, 6:50 p.m.
Zotero extension that combats link rot by archiving webpages and journal articles.
created at Aug. 29, 2019, 5:51 p.m.