flameshot by flameshot-org

Powerful yet simple to use screenshot software :desktop_computer: :camera_flash:

created at May 10, 2017, 7:44 p.m.

C++

207 +1

25,019 +48

1,599 -1

GitHub
ArchiveBox by ArchiveBox

🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

created at May 5, 2017, 8:50 a.m.

Python

174 +0

22,334 +88

1,184 +6

GitHub
SingleFile by gildas-lormeau

Web Extension for saving a faithful copy of a complete web page in a single HTML file

created at Sept. 12, 2010, 11:50 p.m.

JavaScript

118 +0

15,680 +45

1,017 +3

GitHub
badger by dgraph-io

Fast key-value DB in Go.

created at Jan. 26, 2017, 5:09 a.m.

Go

231 +0

13,976 +32

1,184 +2

GitHub
monolith by Y2Z

⬛️ CLI tool for saving complete web pages as a single HTML file

created at Feb. 20, 2017, 7:47 a.m.

Rust

62 +0

11,217 +32

315 +1

GitHub
chrome-remote-interface by cyrus-and

Chrome Debugging Protocol interface for Node.js

created at April 17, 2013, 6 p.m.

JavaScript

80 +1

4,294 +6

310 +1

GitHub
dn by dosyago

💾 dn - offline full-text search and archiving for your Chromium-based browser.

created at Dec. 20, 2019, 9:47 a.m.

JavaScript

43 +0

3,784 +1

145 +0

GitHub
xdotool by jordansissel

fake keyboard/mouse input, window management, and more

created at Feb. 16, 2011, 2:41 a.m.

C

61 +0

3,263 +12

321 +2

GitHub
wayback by wabarc

An archiving tool with an IM-style interface that prioritizes privacy and accessibility, integrated with various archival services including Internet Archive, archive.today, Ghostarchive, IPFS, Telegraph, and file systems.

created at June 13, 2020, 10:08 a.m.

Go

10 +0

1,814 +10

64 +0

GitHub
internetarchive by jjjake

A Python and Command-Line Interface to Archive.org

created at Aug. 15, 2012, 7:18 p.m.

Python

56 +0

1,625 +9

219 +1

GitHub
pywb by webrecorder

Core Python Web Archiving Toolkit for replay and recording of web archives

created at Dec. 9, 2013, 3:30 a.m.

JavaScript

61 +0

1,407 +9

218 +1

GitHub
grab-site by ArchiveTeam

The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns

created at Feb. 5, 2015, 5:01 a.m.

Python

40 +0

1,398 +4

135 +0

GitHub
twarc by DocNow

A command line tool (and Python library) for archiving Twitter JSON

created at Jan. 14, 2013, 2:35 p.m.

Python

35 +0

1,370 +0

255 +0

GitHub
wikiteam by WikiTeam

Tools for downloading and preserving wikis. We archive wikis, from Wikipedia to tiniest wikis. As of 2024, WikiTeam has preserved more than 600,000 wikis.

created at June 25, 2014, 10:18 a.m.

Python

40 +0

729 +0

149 +0

GitHub
brozzler by internetarchive

brozzler - distributed browser-based web crawler

created at July 13, 2015, 11:48 p.m.

Python

40 +0

671 +2

97 +0

GitHub
browsertrix-crawler by webrecorder

Run a high-fidelity browser-based web archiving crawler in a single Docker container

created at Nov. 2, 2020, 4:37 a.m.

TypeScript

23 +0

651 +9

83 +1

GitHub
ipwb by oduwsdl

InterPlanetary Wayback: A distributed and persistent archive replay system using IPFS

created at March 4, 2016, 3:01 p.m.

Python

23 +0

617 +1

39 +0

GitHub
auto-archiver by bellingcat

Automatically archive links to videos, images, and social media content from Google Sheets (and more).

created at Jan. 15, 2021, 10:30 a.m.

Python

22 +1

570 +4

60 +1

GitHub
wpull by ArchiveTeam

Wget-compatible web downloader and crawler.

created at Dec. 7, 2013, 1:03 p.m.

HTML

23 +0

557 +2

77 +0

GitHub
awesome-website-change-monitoring by edgi-govdata-archiving

A curated list of awesome tools for website diffing and change monitoring.

created at May 24, 2017, 5:33 a.m.

Unknown languages

30 +0

495 +1

31 +0

GitHub