pywb in iipc/awesome-web-archiving

Core Python Web Archiving Toolkit for replay and recording of web archives

created at Dec. 9, 2013, 3:30 a.m.

JavaScript

60 +0

1,300 +4

204 +1

GitHub
archiveweb.page in ipfs/awesome-ipfs

A High-Fidelity Web Archiving Extension for Chrome and Chromium based browsers!

created at Feb. 10, 2020, 8:17 p.m.

JavaScript

19 +0

728 +2

51 +1

GitHub
browsertrix-crawler in iipc/awesome-web-archiving

Run a high-fidelity browser-based crawler in a single Docker container

created at Nov. 2, 2020, 4:37 a.m.

TypeScript

23 +0

538 +6

67 +2

GitHub
warcio in iipc/awesome-web-archiving

Streaming WARC/ARC library for fast web archive IO

created at March 6, 2017, 6:17 p.m.

Python

22 +0

342 +2

54 +0

GitHub
browsertrix-cloud in iipc/awesome-web-archiving

Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more accessible for all!

created at June 28, 2021, 10:46 p.m.

TypeScript

10 +0

119 +1

24 +1

GitHub
har2warc in iipc/awesome-web-archiving

Convert HTTP Archive (HAR) -> Web Archive (WARC) format

created at March 16, 2017, 12:14 a.m.

Python

7 +0

42 +0

3 +0

GitHub