lakeFS by treeverse

lakeFS - Data version control for your data lake | Git for data

created at Sept. 12, 2019, 11:46 a.m.

Go

40 +0

4,083 +17

329 +0

GitHub
ekuiper by lf-edge

Lightweight data stream processing engine for IoT edge

created at July 3, 2019, 7:37 a.m.

Go

41 +0

1,365 +1

387 +5

GitHub
memdb by rain1017

Distributed Transactional In-Memory Database (全球首个支持分布式事务的MongoDB)

created at Feb. 18, 2015, 7:35 a.m.

JavaScript

44 +0

597 +0

195 +0

GitHub
Akumuli by akumuli

Time-series database

created at Jan. 28, 2014, 9:31 p.m.

C++

44 +0

838 +1

86 +0

GitHub
smart_open by piskvorky

Utils for streaming large files (S3, HDFS, gzip, bz2...)

created at Jan. 2, 2015, 1:05 p.m.

Python

49 +0

3,094 +1

378 +0

GitHub
iondb by iondbproject

IonDB, a key-value datastore for resource constrained systems.

created at Feb. 26, 2015, 12:07 a.m.

C

50 +0

587 +0

48 +0

GitHub
timely by NationalSecurityAgency

Accumulo backed time series database

created at April 12, 2016, 9:33 p.m.

CSS

51 +0

374 +0

110 +0

GitHub
pinball by pinterest

Pinball is a scalable workflow manager

created at March 4, 2015, 3:13 a.m.

JavaScript

54 +0

1,047 +0

143 +0

GitHub
rocker-compose by grammarly

Docker composition tool with idempotency features for deploying apps composed of multiple containers.

created at June 24, 2015, 4:37 p.m.

Go

54 +0

405 +0

26 +0

GitHub
heroic by spotify

The Heroic Time Series Database

created at May 29, 2015, 5:20 a.m.

Java

58 +0

843 +0

109 +0

GitHub
rudder-server by rudderlabs

Privacy and Security focused Segment-alternative, in Golang and React

created at July 19, 2019, 9:24 a.m.

Go

61 +0

3,940 +8

289 +1

GitHub
aws-sdk-pandas by aws

pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).

created at Feb. 26, 2019, 1:39 a.m.

Python

61 +0

3,805 +3

668 +1

GitHub
PyHive by dropbox

Python interface to Hive and Presto. 🐝

created at Feb. 1, 2014, 9:05 a.m.

Python

62 +0

1,665 +0

552 +0

GitHub
gockerize by redbooth

Package golang service into minimal docker containers.

created at Aug. 4, 2015, 2:02 a.m.

Shell

65 -1

667 +0

20 +0

GitHub
secor by pinterest

Secor is a service implementing Kafka log persistence

created at April 15, 2014, 10:26 p.m.

Java

70 +0

1,835 +0

541 +0

GitHub
mysql_utils by pinterest

Pinterest MySQL Management Tools

created at Oct. 24, 2015, 5:33 p.m.

Python

72 +0

879 +0

141 +0

GitHub
ccm by riptano

A script to easily create and destroy an Apache Cassandra cluster on localhost

created at March 1, 2011, 9:42 a.m.

Python

76 +0

1,212 +0

302 +0

GitHub
kcat by edenhill

Generic command line non-JVM Apache Kafka producer and consumer

created at March 30, 2014, 4:25 a.m.

C

79 +0

5,260 +14

473 +1

GitHub
snappydata by TIBCOSoftware

Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in one cluster

created at Sept. 16, 2015, 10:36 a.m.

Scala

84 +0

1,037 +0

203 +0

GitHub
HyperDex by rescrv

HyperDex is a scalable, searchable key-value store

created at Feb. 20, 2012, 11:32 a.m.

C++

88 +0

1,394 +0

168 +0

GitHub