This data set includes Landsat 8 images and their manually extracted pixel-level ground truths for cloud detection.
updated at April 12, 2024, 8:25 a.m.
All-Age-Faces (AAF) Database.
updated at April 26, 2024, 6:52 a.m.
The Washington Post's analysis of NOAA climate change data for the contiguous United States
updated at April 29, 2024, 8:08 a.m.
MOVED - The project is still under development but this page is deprecated.
updated at May 3, 2024, 9:37 p.m.
Large medical text dataset curated for abbreviation disambiguation, designed for natural language understanding pre-training in the medical domain
updated at May 4, 2024, 4:36 a.m.
Datasets of the daily Twitter output of Congress.
updated at May 5, 2024, 4:24 a.m.
Uber trip data from a freedom of information request to NYC's Taxi & Limousine Commission
updated at May 6, 2024, 1:50 a.m.
The first realistic and public dataset with rare undesirable real events in oil wells.
updated at May 8, 2024, 9:01 a.m.
American Gut open-access data and IPython notebooks
updated at May 9, 2024, 3:30 a.m.
List of all countries with names and ISO 3166-1 codes in all languages and data formats.
updated at May 9, 2024, 5:08 a.m.
The Public Utility Data Liberation Project provides analysis-ready energy system data to climate advocates, researchers, policymakers, and journalists.
updated at May 9, 2024, 4:59 p.m.
List of Dirty, Naughty, Obscene, and Otherwise Bad Words
updated at May 9, 2024, 11:18 p.m.
The Turing Change Point Dataset - A collection of time series for the evaluation and development of change point detection algorithms
updated at May 10, 2024, 4:42 a.m.
The ultimate list of open data semantic 3D city models
updated at May 10, 2024, 8:20 a.m.