Distributed Deep Learning, with a focus on distributed training, using Keras and Apache Spark.
updated at May 10, 2024, 5:12 a.m.
XML data source for Spark SQL and DataFrames
updated at May 10, 2024, 3:38 a.m.
Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.
updated at May 10, 2024, 3:34 a.m.
Essential Spark extensions and helper methods ✨😲
updated at May 9, 2024, 4:48 p.m.
DataStax Connector for Apache Spark to Apache Cassandra
updated at May 9, 2024, 3:23 a.m.
REST job server for Apache Spark
updated at May 9, 2024, 3:16 a.m.
Sparkling Water provides H2O functionality inside Spark cluster
updated at May 8, 2024, 4:42 p.m.
Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
updated at May 6, 2024, 10:14 a.m.
This projects gives Kotlin bindings and several extensions for Apache Spark. We are looking to have this as a part of Apache Spark 3.x
updated at May 5, 2024, 10:58 a.m.
Jupyter magics and kernels for working with remote Spark clusters
updated at May 3, 2024, 11:02 p.m.