chrome-remote-interface by cyrus-and

Chrome Debugging Protocol interface for Node.js

created at April 17, 2013, 6 p.m.

JavaScript

81 +0

4,192 +3

300 +1

GitHub
xdotool by jordansissel

fake keyboard/mouse input, window management, and more

created at Feb. 16, 2011, 2:41 a.m.

C

56 +0

3,047 +10

311 +1

GitHub
badger by dgraph-io

Fast key-value DB in Go.

created at Jan. 26, 2017, 5:09 a.m.

Go

240 +0

13,427 +23

1,149 +2

GitHub
aut by archivesunleashed

The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.

created at July 6, 2017, 10:13 a.m.

Scala

15 +0

133 +0

33 +0

GitHub
MementoMap by oduwsdl

A Tool to Summarize Web Archive Holdings

created at Jan. 20, 2019, 1:30 a.m.

Python

7 +0

9 +0

0 +0

GitHub
warcio by webrecorder

Streaming WARC/ARC library for fast web archive IO

created at March 6, 2017, 6:17 p.m.

Python

22 +0

346 +1

54 +0

GitHub
har2warc by webrecorder

Convert HTTP Archive (HAR) -> Web Archive (WARC) format

created at March 16, 2017, 12:14 a.m.

Python

7 +0

42 +0

3 +0

GitHub
webarchive-discovery by ukwa

WARC and ARC indexing and discovery tools.

created at Dec. 20, 2012, 12:17 p.m.

Java

24 +0

113 +0

24 +0

GitHub
archivenow by oduwsdl

A Tool To Push Web Resources Into Web Archives

created at Feb. 9, 2017, 12:29 p.m.

Python

21 +0

391 +0

41 +0

GitHub
chronicler by CGamesPlay

Offline-first web browser

created at Dec. 27, 2018, 4:01 a.m.

JavaScript

6 +0

83 +0

5 +0

GitHub
wail by N0taN3rd

whale2 One-Click User Instigated Preservation

created at May 26, 2016, 4:52 a.m.

JavaScript

13 +1

120 +1

9 +0

GitHub
awesome-website-change-monitoring by edgi-govdata-archiving

A curated list of awesome tools for website diffing and change monitoring.

created at May 24, 2017, 5:33 a.m.

Unknown languages

31 +0

481 +0

31 +0

GitHub
HadoopConcatGz by helgeho

A Splitable Hadoop InputFormat for Concatenated GZIP Files and *.(w)arc.gz

created at Aug. 8, 2016, 1:36 p.m.

Java

2 +0

9 +0

3 +0

GitHub
grab-site by ArchiveTeam

The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns

created at Feb. 5, 2015, 5:01 a.m.

Python

40 +0

1,270 +6

125 +3

GitHub
wikiteam by WikiTeam

Tools for downloading and preserving wikis. We archive wikis, from Wikipedia to tiniest wikis. As of 2023, WikiTeam has preserved more than 350,000 wikis.

created at June 25, 2014, 10:18 a.m.

Python

40 +0

692 +2

144 +0

GitHub
WarcPartitioner by helgeho

Partition (W)ARC Files by MIME Type and Year

created at Feb. 13, 2017, 3:45 p.m.

Java

2 +0

1 +0

1 +0

GitHub
crocoite by PromyLOPh

Web archiving using Google Chrome

created at Nov. 17, 2017, 6:56 p.m.

Python

8 +0

42 +0

7 +0

GitHub
wasp by webis-de

None

created at March 25, 2018, 6:58 p.m.

Java

13 +0

25 +0

4 +0

GitHub
wpull by ArchiveTeam

Wget-compatible web downloader and crawler.

created at Dec. 7, 2013, 1:03 p.m.

HTML

23 +0

536 +1

77 +1

GitHub
ipwb by oduwsdl

InterPlanetary Wayback: A distributed and persistent archive replay system using IPFS

created at March 4, 2016, 3:01 p.m.

Python

23 +0

590 +0

39 +0

GitHub