Essential Spark extensions and helper methods ✨😲
updated at Nov. 8, 2024, 2:27 a.m.
Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)
updated at Nov. 8, 2024, 12:32 p.m.
This projects gives Kotlin bindings and several extensions for Apache Spark. We are looking to have this as a part of Apache Spark 3.x
updated at Nov. 10, 2024, 6:49 a.m.
pyspark methods to enhance developer productivity 📣 👯 🎉
updated at Nov. 11, 2024, 9:29 a.m.
Jupyter magics and kernels for working with remote Spark clusters
updated at Nov. 14, 2024, 5:19 a.m.
Neo4j Connector for Apache Spark, which provides bi-directional read/write access to Neo4j from Spark, using the Spark DataSource APIs
updated at Nov. 14, 2024, 9:10 a.m.
Base classes to use when writing tests with Spark
updated at Nov. 15, 2024, 9:20 a.m.
Scientific workflow engine designed for simplicity & scalability. Trivially transition between one off use cases to massive scale production environments
updated at Nov. 15, 2024, 9:25 a.m.
State of the Art Natural Language Processing
updated at Nov. 15, 2024, 2:29 p.m.
Sparkling Water provides H2O functionality inside Spark cluster
updated at Nov. 15, 2024, 8:11 p.m.