hive_cassandra_udfs by edwardcapriolo

User Defined Functions for Hive to work with Cassandra

updated at Oct. 15, 2015, 1:59 a.m.

Java

4 +0

11 +0

5 +0

GitHub
accumulo-hive-storage-manager by bfemiano

Working commits for Hive connector to Accumulo. This will eventually be checked directly into Accumulo.

updated at Nov. 19, 2020, 9:41 p.m.

Java

6 +0

13 +0

12 +0

GitHub
Hive-Cassandra by dvasilen

Hive Storage Handler for Cassandra (cloned from https://github.com/riptano/hive/tree/hive-0.8.1-merge/cassandra-handler)

updated at Jan. 28, 2022, 4:46 p.m.

Java

9 +0

15 +0

14 +0

GitHub
gdata-storagehandler by balshor

A Hive StorageHandler that uses a Google Spreadsheet as a backend.

updated at Feb. 11, 2022, 5:07 a.m.

Java

3 +0

14 +0

4 +0

GitHub
Hive-Extensions-from-Think-Big-Analytics by ThinkBigAnalytics

Reusable code for Hive

updated at April 1, 2022, 2 a.m.

Java

316 +0

16 +0

14 +0

GitHub
flume-udp-source by whitepages

Apache Flume source plugin allowing direct consumption of UDP messages

updated at Jan. 28, 2023, 9:42 a.m.

Java

4 +0

8 +0

9 +0

GitHub
ls-hive by lovelysystems

Lovely Systems Hive Goodies

updated at Jan. 28, 2023, 7:23 p.m.

Java

16 +0

5 +0

2 +0

GitHub
flume-ng-rabbitmq by jcustenborder

Flume plugin for RabbitMQ

updated at Aug. 18, 2023, 12:35 a.m.

Java

10 +0

59 +0

46 +0

GitHub
akela by mozilla-metrics

A bunch of utility classes for Java, Hadoop, HBase, Pig, etc.

updated at Oct. 6, 2023, 4:55 p.m.

Java

23 +0

76 +0

31 +0

GitHub
varaha by thedatachef

Machine learning and natural language processing with Apache Pig

updated at Oct. 14, 2023, 8:49 a.m.

Java

9 +0

53 +0

15 +0

GitHub
mpich2-yarn by alibaba

Running MPICH2 on Yarn

updated at Dec. 19, 2023, 7:30 a.m.

Java

34 +0

114 +0

62 +0

GitHub
elephant-bird by twitter

Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.

updated at March 31, 2024, 2:11 p.m.

Java

189 +0

1,137 +0

390 +0

GitHub
flume-ng-mongodb-sink by leonlee

Flume NG MongoDB source.

updated at March 31, 2024, 2:13 p.m.

Java

13 +0

71 +0

62 +0

GitHub
white-elephant by LinkedInAttic

Hadoop log aggregator and dashboard

updated at March 31, 2024, 2:13 p.m.

Java

97 +0

190 +0

63 +0

GitHub
ankush by Impetus

A big data cluster management tool that creates and manages clusters of different technologies.

updated at April 2, 2024, 5:41 p.m.

Java

13 +0

21 +0

17 +0

GitHub
haeinsa by VCNC

Haeinsa is linearly scalable multi-row, multi-table transaction library for HBase

updated at April 6, 2024, 6:01 a.m.

Java

30 +0

158 +0

47 +0

GitHub
registry by hortonworks

Schema Registry

updated at April 10, 2024, 1:30 p.m.

Java

202 -1

12 +0

7 +0

GitHub
oryx by OryxProject

Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning

updated at May 6, 2024, 10:14 a.m.

Java

209 +0

1,789 +0

405 +0

GitHub
Hive-mongo by yc-huang

hive storage handler for connecting with MongoDB

updated at May 6, 2024, 3:14 p.m.

Java

10 +0

32 +0

33 +0

GitHub
hive-solr by chimpler

Hive Storage Handler for SOLR

updated at May 6, 2024, 3:14 p.m.

Java

10 +0

16 +0

26 +0

GitHub