DataStax Connector for Apache Spark to Apache Cassandra
updated at May 24, 2024, 12:26 p.m.
XML data source for Spark SQL and DataFrames
updated at May 23, 2024, 1:15 a.m.
A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.
updated at May 22, 2024, 4:13 p.m.
Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
updated at May 19, 2024, 10:14 a.m.
An implementation of DBSCAN runing on top of Apache Spark
updated at May 18, 2024, 7:22 a.m.
Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)
updated at May 17, 2024, 4:50 p.m.
This projects gives Kotlin bindings and several extensions for Apache Spark. We are looking to have this as a part of Apache Spark 3.x
updated at May 14, 2024, 10:01 p.m.
Mazerunner extends a Neo4j graph database to run scheduled big data graph compute algorithms at scale with HDFS and Apache Spark.
updated at May 14, 2024, 7:19 a.m.