Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
updated at June 5, 2024, 8:40 a.m.
Base classes to use when writing tests with Spark
updated at June 4, 2024, 6:53 p.m.
This projects gives Kotlin bindings and several extensions for Apache Spark. We are looking to have this as a part of Apache Spark 3.x
updated at June 4, 2024, 11:23 a.m.
Sparkling Water provides H2O functionality inside Spark cluster
updated at June 4, 2024, 1:38 a.m.
XML data source for Spark SQL and DataFrames
updated at June 2, 2024, 10:47 p.m.
Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.
updated at June 1, 2024, 5:45 p.m.
A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.
updated at May 31, 2024, 2:15 p.m.
Essential Spark extensions and helper methods ✨😲
updated at May 30, 2024, 2:58 p.m.
DataStax Connector for Apache Spark to Apache Cassandra
updated at May 28, 2024, 4:10 p.m.
A library for time series analysis on Apache Spark
updated at May 28, 2024, 3:01 a.m.
Interactive and Reactive Data Science using Scala and Spark.
updated at May 28, 2024, 1:47 a.m.