Chrome Debugging Protocol interface for Node.js
created at April 17, 2013, 6 p.m.
fake keyboard/mouse input, window management, and more
created at Feb. 16, 2011, 2:41 a.m.
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
created at July 6, 2017, 10:13 a.m.
Streaming WARC/ARC library for fast web archive IO
created at March 6, 2017, 6:17 p.m.
Convert HTTP Archive (HAR) -> Web Archive (WARC) format
created at March 16, 2017, 12:14 a.m.
WARC and ARC indexing and discovery tools.
created at Dec. 20, 2012, 12:17 p.m.
A Tool To Push Web Resources Into Web Archives
created at Feb. 9, 2017, 12:29 p.m.
A curated list of awesome tools for website diffing and change monitoring.
created at May 24, 2017, 5:33 a.m.
A Splitable Hadoop InputFormat for Concatenated GZIP Files and *.(w)arc.gz
created at Aug. 8, 2016, 1:36 p.m.
The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
created at Feb. 5, 2015, 5:01 a.m.
Partition (W)ARC Files by MIME Type and Year
created at Feb. 13, 2017, 3:45 p.m.