Thrift-API-HiveClient2 by dmorel

Perl to HiveServer2 Thrift API wrapper

created at Dec. 9, 2016, 10:34 p.m.

Perl

2 +0

0 +0

1 +0

GitHub
registry by hortonworks

Schema Registry

created at Oct. 26, 2016, 8:28 a.m.

Java

203 +0

12 +0

7 +0

GitHub
schema-registry-ui by Landoop

Web tool for Avro Schema Registry |

created at June 12, 2016, 1:01 p.m.

JavaScript

36 +0

415 +0

112 +0

GitHub
airflow by apache

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

created at April 13, 2015, 6:04 p.m.

Python

755 +2

34,583 +70

13,573 +23

GitHub
schema-registry by confluentinc

Confluent Schema Registry for Kafka

created at Dec. 9, 2014, 10:38 p.m.

Java

368 +0

2,139 +1

1,101 +2

GitHub
gobblin by apache

A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.

created at Dec. 1, 2014, 6:10 p.m.

Java

167 +0

2,190 +0

742 +0

GitHub
crunch by jondot

A fast to develop, fast to run, Go based toolkit for ETL and feature extraction on Hadoop.

created at Nov. 18, 2014, 7:17 p.m.

Go

18 +0

213 +0

16 +0

GitHub
hdfs by colinmarc

A native go client for HDFS

created at Oct. 8, 2014, 7:37 p.m.

Go

39 +0

1,347 +2

341 +0

GitHub
inviso by Netflix

None

created at July 29, 2014, 7:28 p.m.

JavaScript

456 +1

205 +0

72 +0

GitHub
oryx by OryxProject

Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning

created at July 25, 2014, 8:08 p.m.

Java

209 +0

1,788 +1

405 +0

GitHub
ankush by Impetus

A big data cluster management tool that creates and manages clusters of different technologies.

created at May 29, 2014, 10:05 a.m.

Java

13 +0

21 +0

17 +0

GitHub
flume-udp-source by whitepages

Apache Flume source plugin allowing direct consumption of UDP messages

created at March 18, 2014, 11:32 p.m.

Java

4 +0

8 +0

9 +0

GitHub
PyHive by dropbox

Python interface to Hive and Presto. 🐝

created at Feb. 1, 2014, 9:05 a.m.

Python

62 +0

1,665 +0

552 +0

GitHub
PigPen by Netflix

Map-Reduce for Clojure

created at Dec. 12, 2013, 10:56 p.m.

Clojure

466 +1

558 +1

55 +0

GitHub
Beetest by kawaa

A super simple utility for testing Apache Hive scripts locally for non-Java developers.

created at Dec. 7, 2013, 6:17 p.m.

Java

8 +0

71 +0

23 +0

GitHub
HiveRunner by HiveRunner

An Open Source unit test framework for Hive queries based on JUnit 4 and 5

created at Nov. 22, 2013, 9:19 a.m.

Java

34 +0

252 +0

79 +0

GitHub
banana by lucidworks

Banana for Solr - A Port of Kibana

created at Nov. 21, 2013, 5:30 p.m.

JavaScript

201 +0

670 +0

237 +0

GitHub
haeinsa by VCNC

Haeinsa is linearly scalable multi-row, multi-table transaction library for HBase

created at Aug. 10, 2013, 3:43 p.m.

Java

30 +0

158 +0

47 +0

GitHub
hindex by Huawei-Hadoop

Secondary Index for HBase

created at Aug. 8, 2013, 11:33 a.m.

Java

134 +0

589 +0

289 +0

GitHub
genie by Netflix

Distributed Big Data Orchestration Service

created at June 20, 2013, 8:35 p.m.

Java

521 +1

1,681 -1

365 +0

GitHub