A list of things related to software, literature, and other content for 🕣 Memento
created at Sept. 16, 2016, 1:33 a.m.
Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.
created at March 9, 2015, 8:32 p.m.
Java application to download WARCs from WASAPI
created at April 28, 2017, 9:15 p.m.
An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive.
created at Aug. 6, 2015, 7:42 p.m.
A collection of tools for archiving and analysing the internet.
created at Jan. 14, 2015, 6:53 p.m.
A Rails engine supporting the discovery of web archives.
created at Aug. 3, 2017, 5:45 p.m.
Web Extension for saving a faithful copy of a complete web page in a single HTML file
created at Sept. 12, 2010, 11:50 p.m.
A search interface and wayback machine for the UKWA Solr based warc-indexer framework.
created at Feb. 8, 2017, 9:33 a.m.
Command line tools and libraries for handling and manipulating WARC files (and HTTP contents)
created at March 22, 2013, 8:52 p.m.
A commandline tool and Python library for archiving data from Facebook using the Graph API.
created at Feb. 14, 2017, 11:45 p.m.
A dockerized, queued high fidelity web archiver based on Squidwarc
created at July 21, 2018, 8:31 a.m.