keyvalue
id40323593
nameArchiveSpark
full_namehelgeho/ArchiveSpark
html_urlhttps://github.com/helgeho/ArchiveSpark
descriptionAn Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive.
created_atAug. 6, 2015, 7:42 p.m.
updated_atNov. 12, 2024, 4:49 a.m.
pushed_atSept. 19, 2024, 11:56 a.m.
size1,221
stargazers_count145
watchers_count15
forks_count19
open_issues5
languageScala
awesome_list

https://github.com/iipc/awesome-web-archiving