An orchestration platform for the development, production, and observation of data assets.
updated at May 19, 2024, 11:30 a.m.
Pandas and Spark DataFrame comparison for humans and more!
updated at May 19, 2024, 5:17 a.m.
What's in your data? Extract schema, statistics and entities from datasets
updated at May 18, 2024, 3:05 p.m.
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
updated at May 18, 2024, 12:51 p.m.
Utils for streaming large files (S3, HDFS, gzip, bz2...)
updated at May 16, 2024, 2:13 a.m.