id | 40323593 |
name | ArchiveSpark |
full_name | helgeho/ArchiveSpark |
html_url | https://github.com/helgeho/ArchiveSpark |
description | An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive. |
created_at | Aug. 6, 2015, 7:42 p.m. |
updated_at | Nov. 12, 2024, 4:49 a.m. |
pushed_at | Sept. 19, 2024, 11:56 a.m. |
size | 1,221 |
stargazers_count | 145 |
watchers_count | 15 |
forks_count | 19 |
open_issues | 5 |
language | Scala |
awesome_list |
https://github.com/iipc/awesome-web-archiving
|