elephant-bird by twitter

Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.

updated at March 31, 2024, 2:11 p.m.

Java

189 -1

1,137 +0

390 +0

GitHub
flume-ng-mongodb-sink by leonlee

Flume NG MongoDB source.

updated at March 31, 2024, 2:13 p.m.

Java

13 +0

71 +0

62 +0

GitHub
white-elephant by LinkedInAttic

Hadoop log aggregator and dashboard

updated at March 31, 2024, 2:13 p.m.

Java

97 +0

190 +0

63 +0

GitHub
suro by Netflix

Netflix's distributed Data Pipeline

updated at March 31, 2024, 2:13 p.m.

Java

508 +0

789 +0

168 +0

GitHub
hadoopy by bwhite

Python MapReduce library written in Cython. Visit us in #hadoopy on freenode. See the link below for documentation and tutorials.

updated at April 2, 2024, 5:40 p.m.

C

23 +0

243 +0

59 +0

GitHub
hdfs-du by twitter-archive

Visualize your HDFS cluster usage

updated at April 2, 2024, 5:41 p.m.

JavaScript

138 +0

231 +0

87 +0

GitHub
banana by lucidworks

Banana for Solr - A Port of Kibana

updated at April 2, 2024, 5:41 p.m.

JavaScript

201 +0

670 +0

237 +0

GitHub
ankush by Impetus

A big data cluster management tool that creates and manages clusters of different technologies.

updated at April 2, 2024, 5:41 p.m.

Java

13 +0

21 +0

17 +0

GitHub
packetpig by packetloop

Packetpig - Open Source Big Data Security Analytics

updated at April 6, 2024, 2:42 a.m.

Python

57 +0

298 +0

86 +0

GitHub
haeinsa by VCNC

Haeinsa is linearly scalable multi-row, multi-table transaction library for HBase

updated at April 6, 2024, 6:01 a.m.

Java

30 +0

158 +0

47 +0

GitHub
HiveSwarm by livingsocial

Helpful user defined fuctions / table generating functions for Hive

updated at April 8, 2024, 12:12 a.m.

Java

66 +0

101 +0

46 +0

GitHub
inviso by Netflix

None

updated at April 9, 2024, 3:13 a.m.

JavaScript

456 +0

205 +0

72 +0

GitHub
OdbcHive by recruitcojp

Hive ODBC driver for Windows

updated at April 10, 2024, 12:43 a.m.

C++

3 +0

8 +0

8 +0

GitHub
registry by hortonworks

Schema Registry

updated at April 10, 2024, 1:30 p.m.

Java

203 +0

12 +0

7 +0

GitHub
shib by tagomoris

WebUI for query engines: Hive and Presto

updated at April 19, 2024, 10:53 a.m.

JavaScript

28 +0

198 +0

56 +0

GitHub
Lipstick by Netflix

Pig Visualization framework

updated at April 22, 2024, 8:35 p.m.

JavaScript

491 +0

465 +0

132 +0

GitHub
schema-registry-ui by Landoop

Web tool for Avro Schema Registry |

updated at April 24, 2024, 12:15 p.m.

JavaScript

36 +0

415 +0

112 +0

GitHub
HiBench by Intel-bigdata

HiBench is a big data benchmark suite.

updated at May 1, 2024, 10:35 a.m.

Java

126 +0

1,433 +0

756 +0

GitHub
crunch by jondot

A fast to develop, fast to run, Go based toolkit for ETL and feature extraction on Hadoop.

updated at May 4, 2024, 2:49 a.m.

Go

18 +0

213 +0

16 +0

GitHub
oryx by OryxProject

Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning

updated at May 6, 2024, 10:14 a.m.

Java

209 +0

1,789 +1

405 +0

GitHub