bistro by asavinov

A general-purpose data analysis engine radically changing the way batch and stream data is processed

created at Nov. 9, 2017, 3:42 p.m.

Java

2 +0

7 +0

0 +0

GitHub
snackfs-release by tuplejump

The GA Release of SnackFS

created at March 19, 2014, 6:17 a.m.

Scala

7 +0

13 +0

5 +0

GitHub
micro-s3-persistence by shinymayhem

Docker microservice for saving/restoring volume data to S3

created at March 21, 2015, 10:05 p.m.

JavaScript

3 +0

13 +0

1 +0

GitHub
pace by getstrm

Data policy IN, dynamic view OUT: PACE is the Policy As Code Engine. It helps you to programatically create and apply a data policy to a processing platform like Databricks, Snowflake or BigQuery (or plain 'ol Postgres, even!) with definitions imported from Collibra, Datahub, ODD and the like.

created at Oct. 18, 2023, 12:49 p.m.

Kotlin

3 +0

31 +0

0 +0

GitHub
kafka-logger by uber

A kafka logger for winston

created at Oct. 14, 2014, 10:14 p.m.

JavaScript

2,727 +0

45 +0

11 +0

GitHub
dqo by dqops

Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observability. Configure data quality checks from the UI or in YAML files, let DQOps run the data quality checks daily to detect data quality issues.

created at March 8, 2022, 3:18 p.m.

Java

5 +0

52 +0

11 +0

GitHub
pg_kafka by xstevens

INACTIVE: A PostgreSQL extension to produce messages to Apache Kafka.

created at Sept. 5, 2014, 6:33 p.m.

C

9 +0

113 +0

15 -1

GitHub
zodiac by CenturyLinkLabs

A lightweight tool for easy deployment and rollback of dockerized applications.

created at May 6, 2015, 6:32 p.m.

Go

22 +0

195 +0

20 +0

GitHub
deep-spark by Stratio

Connecting Apache Spark with different data stores [DEPRECATED]

created at Feb. 18, 2014, 8:34 a.m.

Java

115 +0

197 +0

42 +0

GitHub
docker-logstash by pblittle

Docker image for Logstash 1.4

created at Feb. 4, 2014, 1:09 a.m.

Shell

7 +0

237 +0

90 +0

GitHub
kyoto by AlticeLabsProjects

Kyoto Tycoon key-value store (and the underlying Kyoto Cabinet library)

created at Dec. 24, 2014, 5:55 p.m.

C++

29 +0

271 +0

40 +0

GitHub
incubator-hivemall by apache

Mirror of Apache Hivemall (incubating)

created at Sept. 15, 2016, 7 a.m.

Java

32 +0

310 +0

119 +0

GitHub
delight by datamechanics

A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.

created at Oct. 26, 2020, 1:56 p.m.

Scala

16 +0

334 +0

50 +0

GitHub
timely by NationalSecurityAgency

Accumulo backed time series database

created at April 12, 2016, 9:33 p.m.

CSS

51 +0

374 +0

110 +0

GitHub
datacompy by capitalone

Pandas and Spark DataFrame comparison for humans and more!

created at March 23, 2018, 1:16 p.m.

Python

25 +0

379 +2

118 +0

GitHub
rocker-compose by grammarly

Docker composition tool with idempotency features for deploying apps composed of multiple containers.

created at June 24, 2015, 4:37 p.m.

Go

54 +0

405 +0

26 +0

GitHub
zilla by aklivity

🦎 A multi-protocol, event-native proxy. Securely interface web apps, IoT clients, & microservices to Apache Kafka® via declaratively defined, stateless APIs.

created at Dec. 7, 2021, 10:10 p.m.

Java

9 +0

483 +1

47 +1

GitHub
eventsim by Interana

Event data simulator. Generates a stream of pseudo-random events from a set of users, designed to simulate web traffic.

created at Sept. 5, 2014, 4:27 p.m.

Scala

111 +0

486 +1

126 +3

GitHub
kafkat by airbnb

KafkaT-ool

created at Aug. 14, 2014, 10:09 p.m.

Ruby

243 +0

504 +1

86 -4

GitHub
iondb by iondbproject

IonDB, a key-value datastore for resource constrained systems.

created at Feb. 26, 2015, 12:07 a.m.

C

50 +0

587 +0

48 +0

GitHub