grab-site in iipc/awesome-web-archiving

The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns

updated at May 19, 2024, 9:13 p.m.

Python

40 +0

1,273 +1

125 +0

GitHub
wpull in iipc/awesome-web-archiving

Wget-compatible web downloader and crawler.

updated at May 24, 2024, 4:29 a.m.

HTML

23 +0

538 +2

77 +0

GitHub