keyvalue
id40323593
nameArchiveSpark
full_namehelgeho/ArchiveSpark
html_urlhttps://github.com/helgeho/ArchiveSpark
descriptionAn Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive.
created_atAug. 6, 2015, 7:42 p.m.
updated_atMay 5, 2024, 4:14 a.m.
pushed_atApril 4, 2024, 10:26 a.m.
size1,204
stargazers_count141
watchers_count14
forks_count19
open_issues4
languageScala
awesome_list

https://github.com/iipc/awesome-web-archiving