har2warc in iipc/awesome-web-archiving

Convert HTTP Archive (HAR) -> Web Archive (WARC) format

updated at March 12, 2024, 12:41 p.m.

Python

7 +0

42 +0

3 +0

GitHub
warcio in iipc/awesome-web-archiving

Streaming WARC/ARC library for fast web archive IO

updated at May 11, 2024, 6:43 a.m.

Python

22 +0

346 +1

54 +0

GitHub
archiveweb.page in ipfs/awesome-ipfs

A High-Fidelity Web Archiving Extension for Chrome and Chromium based browsers!

updated at May 11, 2024, 10:54 a.m.

JavaScript

19 +0

739 +2

53 +1

GitHub
browsertrix in iipc/awesome-web-archiving

Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more accessible for all!

updated at May 12, 2024, 11:36 a.m.

TypeScript

10 +0

127 +3

26 +0

GitHub
pywb in iipc/awesome-web-archiving

Core Python Web Archiving Toolkit for replay and recording of web archives

updated at May 12, 2024, 1:47 p.m.

JavaScript

61 +1

1,309 +6

207 +1

GitHub
browsertrix-crawler in iipc/awesome-web-archiving

Run a high-fidelity browser-based crawler in a single Docker container

updated at May 12, 2024, 4:42 p.m.

TypeScript

23 +0

551 +4

69 +1

GitHub