Apache Spark datasource for OrientDB
updated at Aug. 3, 2022, 7:26 a.m.
The official Riak Spark Connector for Apache Spark with Riak TS and Riak KV
updated at Sept. 27, 2023, 10:28 a.m.
Stanford CoreNLP wrapper for Apache Spark
updated at Jan. 21, 2024, 2:22 p.m.
C4E, a JVM friendly library written in Scala for both local and distributed (Spark) Clustering.
updated at Feb. 29, 2024, 4:50 a.m.
A library for time series analysis on Apache Spark
updated at April 24, 2024, 9:39 a.m.
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
updated at May 1, 2024, 4:39 p.m.
REST job server for Apache Spark
updated at May 9, 2024, 3:16 a.m.
Essential Spark extensions and helper methods ✨😲
updated at May 12, 2024, 6:41 p.m.
Neo4j Connector for Apache Spark, which provides bi-directional read/write access to Neo4j from Spark, using the Spark DataSource APIs
updated at May 13, 2024, 8:43 a.m.
ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.
updated at May 13, 2024, 11:56 a.m.
Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)
updated at May 17, 2024, 4:50 p.m.
An implementation of DBSCAN runing on top of Apache Spark
updated at May 18, 2024, 7:22 a.m.