pywb in iipc/awesome-web-archiving

Core Python Web Archiving Toolkit for replay and recording of web archives

created at Dec. 9, 2013, 3:30 a.m.

JavaScript

61 +0

1,418 +1

218 +0

GitHub
archiveweb.page in ipfs/awesome-ipfs

A High-Fidelity Web Archiving Extension for Chrome and Chromium based browsers!

created at Feb. 10, 2020, 8:17 p.m.

TypeScript

19 +0

901 +9

62 +0

GitHub
browsertrix-crawler in iipc/awesome-web-archiving

Run a high-fidelity browser-based web archiving crawler in a single Docker container

created at Nov. 2, 2020, 4:37 a.m.

TypeScript

24 +0

677 +4

86 +1

GitHub
warcio in iipc/awesome-web-archiving

Streaming WARC/ARC library for fast web archive IO

created at March 6, 2017, 6:17 p.m.

Python

22 +0

390 +1

58 +0

GitHub
browsertrix in iipc/awesome-web-archiving

Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more accessible for all!

created at June 28, 2021, 10:46 p.m.

TypeScript

12 +0

211 +2

37 +1

GitHub
har2warc in iipc/awesome-web-archiving

Convert HTTP Archive (HAR) -> Web Archive (WARC) format

created at March 16, 2017, 12:14 a.m.

Python

7 +0

48 +0

4 +0

GitHub