Spark Gotchas. A subjective compilation of the Apache Spark tips and tricks
updated at April 30, 2024, 6:38 p.m.
A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.
updated at April 30, 2024, 9:48 p.m.
Interactive and Reactive Data Science using Scala and Spark.
updated at May 1, 2024, 3:08 p.m.
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
updated at May 1, 2024, 4:39 p.m.
Jupyter magics and kernels for working with remote Spark clusters
updated at May 3, 2024, 11:02 p.m.
This projects gives Kotlin bindings and several extensions for Apache Spark. We are looking to have this as a part of Apache Spark 3.x
updated at May 5, 2024, 10:58 a.m.
Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
updated at May 6, 2024, 10:14 a.m.
Sparkling Water provides H2O functionality inside Spark cluster
updated at May 8, 2024, 4:42 p.m.