multiwoven by Multiwoven

🔥 Open Source Reverse ETL and Customer Data Platform (CDP). An open-source alternative to tools like Hightouch, Census, and RudderStack.

created at Oct. 20, 2023, 3:21 p.m.

Ruby

12 +0

617 +4

27 +3

GitHub
haproxy_exporter by prometheus

Simple server that scrapes HAProxy stats and exports them via HTTP for Prometheus consumption

created at Jan. 31, 2013, 3:33 p.m.

Go

30 +0

610 +2

219 +0

GitHub
memdb by rain1017

Distributed Transactional In-Memory Database (全球首个支持分布式事务的MongoDB)

created at Feb. 18, 2015, 7:35 a.m.

JavaScript

44 +0

597 +0

195 +0

GitHub
blueflood by rax-maas

A distributed system designed to ingest and process time series data

created at May 15, 2013, 2:50 p.m.

Java

95 +0

592 +0

102 +0

GitHub
iondb by iondbproject

IonDB, a key-value datastore for resource constrained systems.

created at Feb. 26, 2015, 12:07 a.m.

C

50 +0

587 +0

48 +0

GitHub
kafkat by airbnb

KafkaT-ool

created at Aug. 14, 2014, 10:09 p.m.

Ruby

243 +0

504 +1

86 -4

GitHub
eventsim by Interana

Event data simulator. Generates a stream of pseudo-random events from a set of users, designed to simulate web traffic.

created at Sept. 5, 2014, 4:27 p.m.

Scala

111 +0

486 +1

126 +3

GitHub
zilla by aklivity

🦎 A multi-protocol, event-native proxy. Securely interface web apps, IoT clients, & microservices to Apache Kafka® via declaratively defined, stateless APIs.

created at Dec. 7, 2021, 10:10 p.m.

Java

9 +0

483 +1

47 +1

GitHub
rocker-compose by grammarly

Docker composition tool with idempotency features for deploying apps composed of multiple containers.

created at June 24, 2015, 4:37 p.m.

Go

54 +0

405 +0

26 +0

GitHub
datacompy by capitalone

Pandas and Spark DataFrame comparison for humans and more!

created at March 23, 2018, 1:16 p.m.

Python

25 +0

379 +2

118 +0

GitHub
timely by NationalSecurityAgency

Accumulo backed time series database

created at April 12, 2016, 9:33 p.m.

CSS

51 +0

374 +0

110 +0

GitHub
delight by datamechanics

A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.

created at Oct. 26, 2020, 1:56 p.m.

Scala

16 +0

334 +0

50 +0

GitHub
incubator-hivemall by apache

Mirror of Apache Hivemall (incubating)

created at Sept. 15, 2016, 7 a.m.

Java

32 +0

310 +0

119 +0

GitHub
kyoto by AlticeLabsProjects

Kyoto Tycoon key-value store (and the underlying Kyoto Cabinet library)

created at Dec. 24, 2014, 5:55 p.m.

C++

29 +0

271 +0

40 +0

GitHub
docker-logstash by pblittle

Docker image for Logstash 1.4

created at Feb. 4, 2014, 1:09 a.m.

Shell

7 +0

237 +0

90 +0

GitHub
deep-spark by Stratio

Connecting Apache Spark with different data stores [DEPRECATED]

created at Feb. 18, 2014, 8:34 a.m.

Java

115 +0

197 +0

42 +0

GitHub
zodiac by CenturyLinkLabs

A lightweight tool for easy deployment and rollback of dockerized applications.

created at May 6, 2015, 6:32 p.m.

Go

22 +0

195 +0

20 +0

GitHub
pg_kafka by xstevens

INACTIVE: A PostgreSQL extension to produce messages to Apache Kafka.

created at Sept. 5, 2014, 6:33 p.m.

C

9 +0

113 +0

15 -1

GitHub
dqo by dqops

Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observability. Configure data quality checks from the UI or in YAML files, let DQOps run the data quality checks daily to detect data quality issues.

created at March 8, 2022, 3:18 p.m.

Java

5 +0

52 +0

11 +0

GitHub
kafka-logger by uber

A kafka logger for winston

created at Oct. 14, 2014, 10:14 p.m.

JavaScript

2,727 +0

45 +0

11 +0

GitHub