har2warc in iipc/awesome-web-archiving

Convert HTTP Archive (HAR) -> Web Archive (WARC) format

updated at March 12, 2024, 12:41 p.m.

Python

7 +0

42 +0

3 +0

GitHub
browsertrix-crawler in iipc/awesome-web-archiving

Run a high-fidelity browser-based crawler in a single Docker container

updated at May 15, 2024, 6:06 p.m.

TypeScript

24 +1

552 +1

72 +3

GitHub
browsertrix in iipc/awesome-web-archiving

Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more accessible for all!

updated at May 18, 2024, 1:36 a.m.

TypeScript

10 +0

130 +3

27 +1

GitHub
pywb in iipc/awesome-web-archiving

Core Python Web Archiving Toolkit for replay and recording of web archives

updated at May 18, 2024, 6:04 a.m.

JavaScript

61 +0

1,313 +4

207 +0

GitHub
warcio in iipc/awesome-web-archiving

Streaming WARC/ARC library for fast web archive IO

updated at May 18, 2024, 11:21 a.m.

Python

22 +0

349 +3

54 +0

GitHub
archiveweb.page in ipfs/awesome-ipfs

A High-Fidelity Web Archiving Extension for Chrome and Chromium based browsers!

updated at May 19, 2024, 7:11 a.m.

JavaScript

20 +1

749 +10

54 +1

GitHub