crunch by jondot

A fast to develop, fast to run, Go based toolkit for ETL and feature extraction on Hadoop.

updated at May 4, 2024, 2:49 a.m.

Go

18 +0

213 +0

16 +0

GitHub
schema-registry-ui by Landoop

Web tool for Avro Schema Registry |

updated at April 24, 2024, 12:15 p.m.

JavaScript

36 +0

415 +0

112 +0

GitHub
Lipstick by Netflix

Pig Visualization framework

updated at April 22, 2024, 8:35 p.m.

JavaScript

492 +1

465 +0

132 +0

GitHub
registry by hortonworks

Schema Registry

updated at April 10, 2024, 1:30 p.m.

Java

202 -1

12 +0

7 +0

GitHub
OdbcHive by recruitcojp

Hive ODBC driver for Windows

updated at April 10, 2024, 12:43 a.m.

C++

3 +0

8 +0

8 +0

GitHub
inviso by Netflix

None

updated at April 9, 2024, 3:13 a.m.

JavaScript

457 +1

205 +0

72 +0

GitHub
haeinsa by VCNC

Haeinsa is linearly scalable multi-row, multi-table transaction library for HBase

updated at April 6, 2024, 6:01 a.m.

Java

30 +0

158 +0

47 +0

GitHub
packetpig by packetloop

Packetpig - Open Source Big Data Security Analytics

updated at April 6, 2024, 2:42 a.m.

Python

57 +0

298 +0

86 +0

GitHub
ankush by Impetus

A big data cluster management tool that creates and manages clusters of different technologies.

updated at April 2, 2024, 5:41 p.m.

Java

13 +0

21 +0

17 +0

GitHub
banana by lucidworks

Banana for Solr - A Port of Kibana

updated at April 2, 2024, 5:41 p.m.

JavaScript

201 +0

670 +0

237 +0

GitHub
hdfs-du by twitter-archive

Visualize your HDFS cluster usage

updated at April 2, 2024, 5:41 p.m.

JavaScript

138 +0

231 +0

87 +0

GitHub
hadoopy by bwhite

Python MapReduce library written in Cython. Visit us in #hadoopy on freenode. See the link below for documentation and tutorials.

updated at April 2, 2024, 5:40 p.m.

C

23 +0

243 +0

59 +0

GitHub
white-elephant by LinkedInAttic

Hadoop log aggregator and dashboard

updated at March 31, 2024, 2:13 p.m.

Java

97 +0

190 +0

63 +0

GitHub
flume-ng-mongodb-sink by leonlee

Flume NG MongoDB source.

updated at March 31, 2024, 2:13 p.m.

Java

13 +0

71 +0

62 +0

GitHub
elephant-bird by twitter

Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.

updated at March 31, 2024, 2:11 p.m.

Java

189 +0

1,137 +0

390 +0

GitHub
hannibal by sentric

Hannibal is tool to help monitor and maintain HBase-Clusters that are configured for manual splitting.

updated at March 25, 2024, 11:52 a.m.

Ruby

43 +0

171 +0

60 +0

GitHub
happybase by python-happybase

A developer-friendly Python library to interact with Apache HBase

updated at Feb. 20, 2024, 1:12 p.m.

Python

35 +0

609 +0

162 +0

GitHub
mpich2-yarn by alibaba

Running MPICH2 on Yarn

updated at Dec. 19, 2023, 7:30 a.m.

Java

34 +0

114 +0

62 +0

GitHub
varaha by thedatachef

Machine learning and natural language processing with Apache Pig

updated at Oct. 14, 2023, 8:49 a.m.

Java

9 +0

53 +0

15 +0

GitHub
akela by mozilla-metrics

A bunch of utility classes for Java, Hadoop, HBase, Pig, etc.

updated at Oct. 6, 2023, 4:55 p.m.

Java

23 +0

76 +0

31 +0

GitHub