pem-dataset1 by ECSIM

Proton Exchange Membrane (PEM) Fuel Cell Dataset

created at Jan. 4, 2020, 8:57 a.m.

Jupyter Notebook

7 +0

86 +1

24 +0

GitHub
lemon-dataset by softwaremill

Lemons quality control dataset

created at July 28, 2020, 6:42 a.m.

Unknown languages

5 +0

102 -1

13 +0

GitHub
congresstweets by alexlitel

Datasets of the daily Twitter output of Congress.

created at May 3, 2017, 10:47 p.m.

SCSS

7 +0

105 +0

39 +0

GitHub
covid-19-data by yahoo

COVID-19 datasets are constructed entirely from primary (government and public agency) sources

created at March 30, 2020, 11:05 p.m.

Unknown languages

31 +0

110 +0

25 +0

GitHub
3w_dataset by ricardovvargas

The first realistic and public dataset with rare undesirable real events in oil wells.

created at Jan. 19, 2019, 12:30 a.m.

Jupyter Notebook

14 +0

110 +0

58 +0

GitHub
coin_registry by Blockmodo

A global registry of JSON formatted files on 1500+ cryptocurrency tokens. Provides information like chat rooms, communities, explorers, and contact information on each coin. Used by https://blockmodo.com, DEXs, developers, and exchanges.

created at June 20, 2018, 6:15 a.m.

Unknown languages

8 +0

112 +0

32 +0

GitHub
American-Gut by biocore

American Gut open-access data and IPython notebooks

created at Oct. 1, 2013, 11:39 p.m.

Jupyter Notebook

32 +0

114 +0

81 +0

GitHub
TCPD by alan-turing-institute

The Turing Change Point Dataset - A collection of time series for the evaluation and development of change point detection algorithms

created at Nov. 28, 2019, 4:07 p.m.

Python

8 +0

136 +1

27 +0

GitHub
38-Cloud-A-Cloud-Segmentation-Dataset by SorourMo

This data set includes Landsat 8 images and their manually extracted pixel-level ground truths for cloud detection.

created at Feb. 6, 2019, 12:11 a.m.

MATLAB

6 +0

150 +0

36 +0

GitHub
open-data by freeCodeCamp

None

created at Nov. 25, 2015, 10:15 p.m.

HTML

16 +0

157 +0

39 +0

GitHub
All-Age-Faces-Dataset by JingchunCheng

All-Age-Faces (AAF) Database.

created at Feb. 26, 2019, 12:33 p.m.

Unknown languages

4 +0

181 +1

17 +0

GitHub
tennis_wta by JeffSackmann

WTA Tennis Rankings, Results, and Stats

created at March 13, 2015, 12:21 p.m.

Unknown languages

30 +0

227 +0

148 +2

GitHub
collection by cooperhewitt

Collection Data for Cooper Hewitt, Smithsonian Design Museum

created at Feb. 14, 2012, 7:18 p.m.

Unknown languages

38 +0

229 +0

46 +0

GitHub
awesome-citygml by OloOcki

The ultimate list of open data semantic 3D city models

created at Jan. 7, 2021, 4:48 p.m.

Unknown languages

9 +0

235 +2

29 +0

GitHub
medal by McGill-NLP

Large medical text dataset curated for abbreviation disambiguation, designed for natural language understanding pre-training in the medical domain

created at April 22, 2020, 3:13 a.m.

Python

11 +0

245 +1

40 +0

GitHub
transfermarkt-datasets by dcaribou

⚽️ Extract, prepare and publish Transfermarkt datasets.

created at Dec. 26, 2020, 5:33 p.m.

Python

10 +0

247 +4

57 +1

GitHub
open-traffic-collection by graphhopper

Collection of open data resources for traffic information

created at April 13, 2015, 11:39 a.m.

Unknown languages

28 +0

404 +1

47 -1

GitHub
twofishes by foursquare

MOVED - The project is still under development but this page is deprecated.

created at Feb. 17, 2012, 11:58 p.m.

Scala

201 +0

434 +0

62 +0

GitHub
pudl by catalyst-cooperative

The Public Utility Data Liberation Project provides analysis-ready energy system data to climate advocates, researchers, policymakers, and journalists.

created at Feb. 1, 2017, 5:45 p.m.

Python

19 +0

482 +4

110 +0

GitHub
collection by tategallery

Tate Collection metadata

created at Sept. 16, 2013, 4:05 p.m.

Python

60 +0

511 +0

187 +0

GitHub