REST job server for Apache Spark
updated at May 9, 2024, 3:16 a.m.
Distributed Deep Learning, with a focus on distributed training, using Keras and Apache Spark.
updated at May 10, 2024, 5:12 a.m.
Essential Spark extensions and helper methods ✨😲
updated at May 12, 2024, 6:41 p.m.
Neo4j Connector for Apache Spark, which provides bi-directional read/write access to Neo4j from Spark, using the Spark DataSource APIs
updated at May 13, 2024, 8:43 a.m.
ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.
updated at May 13, 2024, 11:56 a.m.
Mazerunner extends a Neo4j graph database to run scheduled big data graph compute algorithms at scale with HDFS and Apache Spark.
updated at May 14, 2024, 7:19 a.m.
This projects gives Kotlin bindings and several extensions for Apache Spark. We are looking to have this as a part of Apache Spark 3.x
updated at May 14, 2024, 10:01 p.m.
Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)
updated at May 17, 2024, 4:50 p.m.
An implementation of DBSCAN runing on top of Apache Spark
updated at May 18, 2024, 7:22 a.m.
Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
updated at May 19, 2024, 10:14 a.m.
A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.
updated at May 22, 2024, 4:13 p.m.