A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.
created at Oct. 26, 2020, 1:56 p.m.
This projects gives Kotlin bindings and several extensions for Apache Spark. We are looking to have this as a part of Apache Spark 3.x
created at June 1, 2020, 11:07 a.m.
C4E, a JVM friendly library written in Scala for both local and distributed (Spark) Clustering.
created at March 26, 2018, 7:58 p.m.
State of the Art Natural Language Processing
created at Sept. 24, 2017, 7:36 p.m.
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
created at July 6, 2017, 10:13 a.m.
Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.
created at June 25, 2017, 7 a.m.
Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)
created at April 6, 2017, 9:40 p.m.
Essential Spark extensions and helper methods ✨😲
created at Feb. 16, 2017, 3:41 p.m.
Apache (Py)Spark type annotations (stub files).
created at Jan. 31, 2017, 1:13 a.m.
Apache Spark datasource for OrientDB
created at Oct. 31, 2016, 2:51 p.m.