Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
created at July 25, 2014, 8:08 p.m.
Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.
created at March 25, 2010, 1:49 a.m.
Apache Flume source plugin allowing direct consumption of UDP messages
created at March 18, 2014, 11:32 p.m.
Unit test framework for hive and hive-service
created at Sept. 16, 2011, 2:39 p.m.
Working commits for Hive connector to Accumulo. This will eventually be checked directly into Accumulo.
created at March 2, 2013, 10:57 a.m.
A Hive StorageHandler that uses a Google Spreadsheet as a backend.
created at Aug. 19, 2011, 2:40 a.m.
hive storage handler for connecting with MongoDB
created at Nov. 17, 2011, 7:24 a.m.
Hive Storage Handler for Cassandra (cloned from https://github.com/riptano/hive/tree/hive-0.8.1-merge/cassandra-handler)
created at April 12, 2013, 7:33 p.m.
Reusable code for Hive
created at April 6, 2011, 1:45 a.m.
Helpful user defined fuctions / table generating functions for Hive
created at April 5, 2011, 5:46 p.m.
User Defined Functions for Hive to work with Cassandra
created at Dec. 29, 2011, 3:24 p.m.