warcio in iipc/awesome-web-archiving

Streaming WARC/ARC library for fast web archive IO

created at March 6, 2017, 6:17 p.m.

Python

22 +0

385 +2

58 +0

GitHub
har2warc in iipc/awesome-web-archiving

Convert HTTP Archive (HAR) -> Web Archive (WARC) format

created at March 16, 2017, 12:14 a.m.

Python

7 +0

46 +1

4 +0

GitHub
pywb in iipc/awesome-web-archiving

Core Python Web Archiving Toolkit for replay and recording of web archives

created at Dec. 9, 2013, 3:30 a.m.

JavaScript

61 +0

1,407 +9

218 +1

GitHub
browsertrix-crawler in iipc/awesome-web-archiving

Run a high-fidelity browser-based web archiving crawler in a single Docker container

created at Nov. 2, 2020, 4:37 a.m.

TypeScript

23 +0

651 +9

83 +1

GitHub
archiveweb.page in ipfs/awesome-ipfs

A High-Fidelity Web Archiving Extension for Chrome and Chromium based browsers!

created at Feb. 10, 2020, 8:17 p.m.

TypeScript

19 +0

862 +10

61 +1

GitHub
browsertrix in iipc/awesome-web-archiving

Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more accessible for all!

created at June 28, 2021, 10:46 p.m.

TypeScript

12 +0

201 +1

35 +2

GitHub