browsertrix-crawler by webrecorder

Run a high-fidelity browser-based crawler in a single Docker container

created at Nov. 2, 2020, 4:37 a.m.

TypeScript

23 +0

551 +4

69 +1

GitHub
freeze-dry by WebMemex

Snapshots a web page to get it as a static, self-contained HTML document.

created at July 13, 2017, 11:31 p.m.

TypeScript

11 +0

267 +0

18 +0

GitHub
browsertrix by webrecorder

Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more accessible for all!

created at June 28, 2021, 10:46 p.m.

TypeScript

10 +0

127 +3

26 +0

GitHub
cairn by wabarc

NPM package and CLI tool for saving web page as single HTML file

created at Oct. 8, 2020, 7:18 a.m.

TypeScript

4 +0

37 +0

2 +0

GitHub