An orchestration platform for the development, production, and observation of data assets.
updated at May 5, 2024, 4:32 a.m.
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
updated at May 4, 2024, 9:05 p.m.
Pandas and Spark DataFrame comparison for humans and more!
updated at May 3, 2024, 8:43 a.m.
Utils for streaming large files (S3, HDFS, gzip, bz2...)
updated at May 2, 2024, 12:46 p.m.
What's in your data? Extract schema, statistics and entities from datasets
updated at May 2, 2024, 2:22 a.m.