id | 40323593 |
name | ArchiveSpark |
full_name | helgeho/ArchiveSpark |
html_url | https://github.com/helgeho/ArchiveSpark |
description | An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive. |
created_at | Aug. 6, 2015, 7:42 p.m. |
updated_at | May 5, 2024, 4:14 a.m. |
pushed_at | April 4, 2024, 10:26 a.m. |
size | 1,204 |
stargazers_count | 141 |
watchers_count | 14 |
forks_count | 19 |
open_issues | 4 |
language | Scala |
awesome_list |
https://github.com/iipc/awesome-web-archiving
|