Jupyter magics and kernels for working with remote Spark clusters
created at Sept. 21, 2015, 3:35 p.m.
ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.
created at Nov. 19, 2013, 11:47 p.m.
Scientific workflow engine designed for simplicity & scalability. Trivially transition between one off use cases to massive scale production environments
created at April 17, 2015, 7:39 p.m.
Sparkling Water provides H2O functionality inside Spark cluster
created at Oct. 13, 2014, 11:06 p.m.
Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.
created at June 25, 2017, 7 a.m.
Essential Spark extensions and helper methods ✨😲
created at Feb. 16, 2017, 3:41 p.m.
pyspark methods to enhance developer productivity 📣 👯 🎉
created at Sept. 15, 2017, 1:02 p.m.
XML data source for Spark SQL and DataFrames
created at Nov. 26, 2015, 2:46 a.m.
This projects gives Kotlin bindings and several extensions for Apache Spark. We are looking to have this as a part of Apache Spark 3.x
created at June 1, 2020, 11:07 a.m.