pace by getstrm

Data policy IN, dynamic view OUT: PACE is the Policy As Code Engine. It helps you to programatically create and apply a data policy to a processing platform like Databricks, Snowflake or BigQuery (or plain 'ol Postgres, even!) with definitions imported from Collibra, Datahub, ODD and the like.

created at Oct. 18, 2023, 12:49 p.m.

Kotlin

3 +0

31 +0

0 +0

GitHub
bistro by asavinov

A general-purpose data analysis engine radically changing the way batch and stream data is processed

created at Nov. 9, 2017, 3:42 p.m.

Java

2 +0

7 +0

0 +0

GitHub
micro-s3-persistence by shinymayhem

Docker microservice for saving/restoring volume data to S3

created at March 21, 2015, 10:05 p.m.

JavaScript

3 +0

13 +0

1 +0

GitHub
snackfs-release by tuplejump

The GA Release of SnackFS

created at March 19, 2014, 6:17 a.m.

Scala

7 +0

13 +0

5 +0

GitHub
dqo by dqops

Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observability. Configure data quality checks from the UI or in YAML files, let DQOps run the data quality checks daily to detect data quality issues.

created at March 8, 2022, 3:18 p.m.

Java

5 +0

52 +0

11 +0

GitHub
kafka-logger by uber

A kafka logger for winston

created at Oct. 14, 2014, 10:14 p.m.

JavaScript

2,727 +0

45 +0

11 +0

GitHub
pg_kafka by xstevens

INACTIVE: A PostgreSQL extension to produce messages to Apache Kafka.

created at Sept. 5, 2014, 6:33 p.m.

C

9 +0

113 +0

15 -1

GitHub
zodiac by CenturyLinkLabs

A lightweight tool for easy deployment and rollback of dockerized applications.

created at May 6, 2015, 6:32 p.m.

Go

22 +0

195 +0

20 +0

GitHub
gockerize by redbooth

Package golang service into minimal docker containers.

created at Aug. 4, 2015, 2:02 a.m.

Shell

66 +0

667 +0

20 +0

GitHub
rocker-compose by grammarly

Docker composition tool with idempotency features for deploying apps composed of multiple containers.

created at June 24, 2015, 4:37 p.m.

Go

54 +0

405 +0

26 +0

GitHub
multiwoven by Multiwoven

🔥 Open Source Reverse ETL and Customer Data Platform (CDP). An open-source alternative to tools like Hightouch, Census, and RudderStack.

created at Oct. 20, 2023, 3:21 p.m.

Ruby

12 +0

617 +4

27 +3

GitHub
kyoto by AlticeLabsProjects

Kyoto Tycoon key-value store (and the underlying Kyoto Cabinet library)

created at Dec. 24, 2014, 5:55 p.m.

C++

29 +0

271 +0

40 +0

GitHub
deep-spark by Stratio

Connecting Apache Spark with different data stores [DEPRECATED]

created at Feb. 18, 2014, 8:34 a.m.

Java

115 +0

197 +0

42 +0

GitHub
dalmatinerdb by dalmatinerdb

See gitlab: https://gitlab.com/Project-FiFo/DalmatinerDB/dalmatinerdb

created at June 13, 2014, 7:08 p.m.

Erlang

37 +0

697 +0

44 +0

GitHub
zilla by aklivity

🦎 A multi-protocol, event-native proxy. Securely interface web apps, IoT clients, & microservices to Apache Kafka® via declaratively defined, stateless APIs.

created at Dec. 7, 2021, 10:10 p.m.

Java

9 +0

483 +1

47 +1

GitHub
iondb by iondbproject

IonDB, a key-value datastore for resource constrained systems.

created at Feb. 26, 2015, 12:07 a.m.

C

50 +0

587 +0

48 +0

GitHub
delight by datamechanics

A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.

created at Oct. 26, 2020, 1:56 p.m.

Scala

16 +0

334 +0

50 +0

GitHub
hstream by hstreamdb

HStreamDB is an open-source, cloud-native streaming database for IoT and beyond. Modernize your data stack for real-time applications.

created at Aug. 31, 2020, 9:42 a.m.

Haskell

23 +0

691 +0

56 +0

GitHub
kafkat by airbnb

KafkaT-ool

created at Aug. 14, 2014, 10:09 p.m.

Ruby

243 +0

504 +1

86 -4

GitHub
Akumuli by akumuli

Time-series database

created at Jan. 28, 2014, 9:31 p.m.

C++

44 +0

837 +0

86 +0

GitHub