grab-site in iipc/awesome-web-archiving

The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns

created at Feb. 5, 2015, 5:01 a.m.

Python

40 +0

1,270 +6

125 +3

GitHub
wpull in iipc/awesome-web-archiving

Wget-compatible web downloader and crawler.

created at Dec. 7, 2013, 1:03 p.m.

HTML

23 +0

536 +1

77 +1

GitHub