⚽️ Extract, prepare and publish Transfermarkt datasets.
created at Dec. 26, 2020, 5:33 p.m.
Cube++ is a novel dataset collected for illumination estimation problem. It has 4890 raw 18-megapixel images, each containing a SpyderCube color target in their scenes, manually labelled categories, and ground truth illumination chromaticities.
created at July 21, 2020, 1 p.m.
A repo containing various data (demographics, employment, etc.) in JSON form.
created at June 23, 2020, 3:20 a.m.
Large medical text dataset curated for abbreviation disambiguation, designed for natural language understanding pre-training in the medical domain
created at April 22, 2020, 3:13 a.m.
The Turing Change Point Dataset - A collection of time series for the evaluation and development of change point detection algorithms
created at Nov. 28, 2019, 4:07 p.m.
A comprehensive, accessible database that contains records of over 260k US gun violence incidents from January 2013 to March 2018.
created at April 2, 2018, 6:54 p.m.
The Public Utility Data Liberation Project provides analysis-ready energy system data to climate advocates, researchers, policymakers, and journalists.
created at Feb. 1, 2017, 5:45 p.m.
An air travel dataset consisting of user reviews from Skytrax (www.airlinequality.com)
created at Aug. 11, 2015, 11:28 a.m.