Collection Data for Cooper Hewitt, Smithsonian Design Museum
created at Feb. 14, 2012, 7:18 p.m.
An air travel dataset consisting of user reviews from Skytrax (www.airlinequality.com)
created at Aug. 11, 2015, 11:28 a.m.
Large medical text dataset curated for abbreviation disambiguation, designed for natural language understanding pre-training in the medical domain
created at April 22, 2020, 3:13 a.m.
Datasets of the daily Twitter output of Congress.
created at May 3, 2017, 10:47 p.m.
This data set includes Landsat 8 images and their manually extracted pixel-level ground truths for cloud detection.
created at Feb. 6, 2019, 12:11 a.m.
A global registry of JSON formatted files on 1500+ cryptocurrency tokens. Provides information like chat rooms, communities, explorers, and contact information on each coin. Used by https://blockmodo.com, DEXs, developers, and exchanges.
created at June 20, 2018, 6:15 a.m.
The ultimate list of open data semantic 3D city models
created at Jan. 7, 2021, 4:48 p.m.
The Turing Change Point Dataset - A collection of time series for the evaluation and development of change point detection algorithms
created at Nov. 28, 2019, 4:07 p.m.
COVID-19 datasets are constructed entirely from primary (government and public agency) sources
created at March 30, 2020, 11:05 p.m.
Proton Exchange Membrane (PEM) Fuel Cell Dataset
created at Jan. 4, 2020, 8:57 a.m.
The Washington Post's analysis of NOAA climate change data for the contiguous United States
created at Aug. 7, 2020, 12:59 p.m.
All-Age-Faces (AAF) Database.
created at Feb. 26, 2019, 12:33 p.m.
This repo contains a set of Arabic newspaper articles alongwith metadata, extracted from various Saudi newspapers.
created at July 21, 2015, 8:40 p.m.
The tracebase appliance-level power consumption data set
created at Nov. 17, 2017, 12:09 p.m.
A repo containing various data (demographics, employment, etc.) in JSON form.
created at June 23, 2020, 3:20 a.m.