koalas in awesome-spark/awesome-spark

Koalas: pandas API on Apache Spark

updated at May 30, 2024, 5:09 p.m.

Python

319 +1

3,319 -1

354 -1

GitHub
spark-csv in awesome-spark/awesome-spark

CSV Data Source for Apache Spark 1.x

updated at May 27, 2024, 7:55 p.m.

Scala

421 +1

1,050 +1

445 +0

GitHub
spark-sklearn in awesome-spark/awesome-spark

(Deprecated) Scikit-learn integration package for Apache Spark

updated at May 27, 2024, 4:27 p.m.

Python

95 +0

1,076 -1

231 +0

GitHub
spark-avro in awesome-spark/awesome-spark

Avro Data Source for Apache Spark

updated at May 23, 2024, 12:39 p.m.

Scala

70 +0

539 +0

310 +0

GitHub
spark-xml in awesome-spark/awesome-spark

XML data source for Spark SQL and DataFrames

updated at May 23, 2024, 1:15 a.m.

Scala

40 +0

487 +0

224 +0

GitHub
tensorframes in jtoy/awesome-tensorflow

[DEPRECATED] Tensorflow wrapper for DataFrames on Apache Spark

updated at March 20, 2024, 1:37 p.m.

Scala

79 +0

751 +0

162 +0

GitHub
spark-corenlp in awesome-spark/awesome-spark

Stanford CoreNLP wrapper for Apache Spark

updated at Jan. 21, 2024, 2:22 p.m.

Scala

52 +0

423 +0

120 +0

GitHub