pywb by webrecorder

Core Python Web Archiving Toolkit for replay and recording of web archives

updated at May 25, 2024, 3:21 p.m.

JavaScript

61 +0

1,317 +4

206 -1

GitHub
browsertrix-crawler by webrecorder

Run a high-fidelity browser-based crawler in a single Docker container

updated at May 25, 2024, 3:23 p.m.

TypeScript

24 +0

557 +5

72 +0

GitHub
SingleFile by gildas-lormeau

Web Extension for saving a faithful copy of a complete web page in a single HTML file

updated at May 25, 2024, 5:01 p.m.

JavaScript

114 +0

14,008 +55

924 +2

GitHub
badger by dgraph-io

Fast key-value DB in Go.

updated at May 26, 2024, 12:02 a.m.

Go

239 +0

13,457 +12

1,151 +2

GitHub
monolith by Y2Z

⬛️ CLI tool for saving complete web pages as a single HTML file

updated at May 26, 2024, 2:25 a.m.

Rust

62 +0

10,140 +68

287 +2

GitHub
flameshot by flameshot-org

Powerful yet simple to use screenshot software :desktop_computer: :camera_flash:

updated at May 26, 2024, 3:39 a.m.

C++

205 +0

23,358 +42

1,507 +3

GitHub
wikiteam by WikiTeam

Tools for downloading and preserving wikis. We archive wikis, from Wikipedia to tiniest wikis. As of 2023, WikiTeam has preserved more than 350,000 wikis.

updated at May 26, 2024, 3:47 a.m.

Python

40 +0

696 +3

145 +1

GitHub
ArchiveBox by ArchiveBox

🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

updated at May 26, 2024, 5:37 a.m.

Python

171 +1

20,012 +65

1,089 +3

GitHub