Large medical text dataset curated for abbreviation disambiguation, designed for natural language understanding pre-training in the medical domain
updated at April 28, 2024, 4:27 p.m.
The Public Utility Data Liberation Project provides analysis-ready energy system data to climate advocates, researchers, policymakers, and journalists.
updated at April 27, 2024, 3:31 p.m.
⚽️ Extract, prepare and publish Transfermarkt datasets.
updated at April 27, 2024, 12:21 a.m.
The Turing Change Point Dataset - A collection of time series for the evaluation and development of change point detection algorithms
updated at April 19, 2024, 11:45 a.m.
An air travel dataset consisting of user reviews from Skytrax (www.airlinequality.com)
updated at April 2, 2024, 5:40 p.m.
A repo containing various data (demographics, employment, etc.) in JSON form.
updated at March 29, 2024, 10 a.m.
Cube++ is a novel dataset collected for illumination estimation problem. It has 4890 raw 18-megapixel images, each containing a SpyderCube color target in their scenes, manually labelled categories, and ground truth illumination chromaticities.
updated at March 15, 2024, 6:44 a.m.
A comprehensive, accessible database that contains records of over 260k US gun violence incidents from January 2013 to March 2018.
updated at Dec. 14, 2023, 4:48 p.m.