pywb in iipc/awesome-web-archiving

Core Python Web Archiving Toolkit for replay and recording of web archives

updated at Nov. 17, 2024, 1:10 p.m.

JavaScript

61 +0

1,407 +9

218 +1

GitHub
archiveweb.page in ipfs/awesome-ipfs

A High-Fidelity Web Archiving Extension for Chrome and Chromium based browsers!

updated at Nov. 17, 2024, 4:49 a.m.

TypeScript

19 +0

862 +10

61 +1

GitHub
browsertrix in iipc/awesome-web-archiving

Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more accessible for all!

updated at Nov. 16, 2024, 9:38 a.m.

TypeScript

12 +0

201 +1

35 +2

GitHub
warcio in iipc/awesome-web-archiving

Streaming WARC/ARC library for fast web archive IO

updated at Nov. 16, 2024, 9:35 a.m.

Python

22 +0

385 +2

58 +0

GitHub
browsertrix-crawler in iipc/awesome-web-archiving

Run a high-fidelity browser-based web archiving crawler in a single Docker container

updated at Nov. 16, 2024, 9:34 a.m.

TypeScript

23 +0

651 +9

83 +1

GitHub
har2warc in iipc/awesome-web-archiving

Convert HTTP Archive (HAR) -> Web Archive (WARC) format

updated at Nov. 12, 2024, 2:57 a.m.

Python

7 +0

46 +1

4 +0

GitHub