hive_cassandra_udfs by edwardcapriolo

User Defined Functions for Hive to work with Cassandra

updated at Oct. 15, 2015, 1:59 a.m.

Java

4 +0

11 +0

5 +0

GitHub
accumulo-hive-storage-manager by bfemiano

Working commits for Hive connector to Accumulo. This will eventually be checked directly into Accumulo.

updated at Nov. 19, 2020, 9:41 p.m.

Java

6 +0

13 +0

12 +0

GitHub
Hive-Cassandra by dvasilen

Hive Storage Handler for Cassandra (cloned from https://github.com/riptano/hive/tree/hive-0.8.1-merge/cassandra-handler)

updated at Jan. 28, 2022, 4:46 p.m.

Java

9 +0

15 +0

14 +0

GitHub
gdata-storagehandler by balshor

A Hive StorageHandler that uses a Google Spreadsheet as a backend.

updated at Feb. 11, 2022, 5:07 a.m.

Java

3 +0

14 +0

4 +0

GitHub
Hive-Extensions-from-Think-Big-Analytics by ThinkBigAnalytics

Reusable code for Hive

updated at April 1, 2022, 2 a.m.

Java

316 +0

16 +0

14 +0

GitHub
flume-udp-source by whitepages

Apache Flume source plugin allowing direct consumption of UDP messages

updated at Jan. 28, 2023, 9:42 a.m.

Java

4 +0

8 +0

9 +0

GitHub
ls-hive by lovelysystems

Lovely Systems Hive Goodies

updated at Jan. 28, 2023, 7:23 p.m.

Java

16 +0

5 +0

2 +0

GitHub
flume-ng-rabbitmq by jcustenborder

Flume plugin for RabbitMQ

updated at Aug. 18, 2023, 12:35 a.m.

Java

10 +0

59 +0

46 +0

GitHub
akela by mozilla-metrics

A bunch of utility classes for Java, Hadoop, HBase, Pig, etc.

updated at Oct. 6, 2023, 4:55 p.m.

Java

23 +0

76 +0

31 +0

GitHub
varaha by thedatachef

Machine learning and natural language processing with Apache Pig

updated at Oct. 14, 2023, 8:49 a.m.

Java

9 +0

53 +0

15 +0

GitHub
mpich2-yarn by alibaba

Running MPICH2 on Yarn

updated at Dec. 19, 2023, 7:30 a.m.

Java

34 +0

114 +0

62 +0

GitHub
elephant-bird by twitter

Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.

updated at March 31, 2024, 2:11 p.m.

Java

189 -1

1,137 +0

390 +0

GitHub
flume-ng-mongodb-sink by leonlee

Flume NG MongoDB source.

updated at March 31, 2024, 2:13 p.m.

Java

13 +0

71 +0

62 +0

GitHub
white-elephant by LinkedInAttic

Hadoop log aggregator and dashboard

updated at March 31, 2024, 2:13 p.m.

Java

97 +0

190 +0

63 +0

GitHub
suro by Netflix

Netflix's distributed Data Pipeline

updated at March 31, 2024, 2:13 p.m.

Java

508 +0

789 +0

168 +0

GitHub
ankush by Impetus

A big data cluster management tool that creates and manages clusters of different technologies.

updated at April 2, 2024, 5:41 p.m.

Java

13 +0

21 +0

17 +0

GitHub
haeinsa by VCNC

Haeinsa is linearly scalable multi-row, multi-table transaction library for HBase

updated at April 6, 2024, 6:01 a.m.

Java

30 +0

158 +0

47 +0

GitHub
HiveSwarm by livingsocial

Helpful user defined fuctions / table generating functions for Hive

updated at April 8, 2024, 12:12 a.m.

Java

66 +0

101 +0

46 +0

GitHub
registry by hortonworks

Schema Registry

updated at April 10, 2024, 1:30 p.m.

Java

203 +0

12 +0

7 +0

GitHub
HiBench by Intel-bigdata

HiBench is a big data benchmark suite.

updated at May 1, 2024, 10:35 a.m.

Java

126 +0

1,433 +0

756 +0

GitHub