awesome-citygml by OloOcki

The ultimate list of open data semantic 3D city models

created at Jan. 7, 2021, 4:48 p.m.

Unknown languages

7 +0

178 +19

23 +2

GitHub
transfermarkt-datasets by dcaribou

⚽️ Extract, prepare and publish Transfermarkt datasets.

created at Dec. 26, 2020, 5:33 p.m.

Python

7 +1

161 +15

44 +4

GitHub
B3FD by kbesenic

Biometrically Filtered Famous Figure Dataset

created at Nov. 22, 2020, 7:36 p.m.

Unknown languages

1 +0

5 +0

0 +0

GitHub
shopper-intent-prediction-nature-2020 by coveooss

🏟

created at Oct. 29, 2020, 1:52 p.m.

Unknown languages

6 +0

24 +0

5 +0

GitHub
data-2C-beyond-the-limit-usa by washingtonpost

The Washington Post's analysis of NOAA climate change data for the contiguous United States

created at Aug. 7, 2020, 12:59 p.m.

HTML

4 +0

59 +1

20 +1

GitHub
lemon-dataset by softwaremill

Lemons quality control dataset

created at July 28, 2020, 6:42 a.m.

Unknown languages

5 +0

97 +1

12 +0

GitHub
CubePlusPlus by Visillect

Cube++ is a novel dataset collected for illumination estimation problem. It has 4890 raw 18-megapixel images, each containing a SpyderCube color target in their scenes, manually labelled categories, and ground truth illumination chromaticities.

created at July 21, 2020, 1 p.m.

Python

13 +0

49 +1

5 +0

GitHub
MORED by MOREDataset

A Moroccan Buildings’ Electricity Consumption Dataset. MORED is made available by TICLab of the International University of Rabat (UIR), and the data collection was carried out as part of PVBuild research project, coordinated by Prof. Mounir Ghogho and funded by the United States Agency for International Development (USAID).

created at July 13, 2020, 10:48 a.m.

Unknown languages

1 +0

9 +0

2 +0

GitHub
JsonOfCounties by evangambit

A repo containing various data (demographics, employment, etc.) in JSON form.

created at June 23, 2020, 3:20 a.m.

Python

7 +0

56 +0

10 +0

GitHub
medal by McGill-NLP

Large medical text dataset curated for abbreviation disambiguation, designed for natural language understanding pre-training in the medical domain

created at April 22, 2020, 3:13 a.m.

Python

11 +0

199 +8

35 +1

GitHub
covid-19-data by yahoo

COVID-19 datasets are constructed entirely from primary (government and public agency) sources

created at March 30, 2020, 11:05 p.m.

Unknown languages

31 +0

109 +1

25 +0

GitHub
ecuacovid by andrab

Datos sin procesar extraído, limpiado, y normalizado de los informes de la situación nacional frente a la Emergencia Sanitaria SARS-CoV2 (COVID-19) de SNGRE, MSP, Registro Civil, e INEC.

created at March 29, 2020, 7:57 a.m.

Ruby

10 +0

78 +0

58 +0

GitHub
dbfc-dataset by ECSIM

Single DBFC Dataset

created at March 27, 2020, 7:41 p.m.

Jupyter Notebook

4 +0

20 +0

3 +0

GitHub
covid-19-data by NYTimes

A repository of data on coronavirus cases and deaths in the U.S.

created at March 24, 2020, 11:41 p.m.

Unknown languages

318 +1

6,990 +4

3,475 -16

GitHub
COVID-19 by CSSEGISandData

Novel Coronavirus (COVID-19) Cases, provided by JHU CSSE

created at Feb. 4, 2020, 10:03 p.m.

Unknown languages

865 -3

29,188 -18

18,519 -125

GitHub
domains by tb0hdan

World’s single largest Internet domains dataset

created at Jan. 12, 2020, 10:39 p.m.

HTML

29 +1

628 +21

103 +2

GitHub
pem-dataset1 by ECSIM

Proton Exchange Membrane (PEM) Fuel Cell Dataset

created at Jan. 4, 2020, 8:57 a.m.

Jupyter Notebook

6 +0

75 +1

23 +0

GitHub
TCPD by alan-turing-institute

The Turing Change Point Dataset - A collection of time series for the evaluation and development of change point detection algorithms

created at Nov. 28, 2019, 4:07 p.m.

Python

8 +0

128 +5

27 +3

GitHub
Pro-Kabadi-season-1-7-Stats by ranganadhkodali

This Repo contain both Python Code (unorganized) and Data Used for Downloading Stats Data from Pro Kabadi.

created at Sept. 25, 2019, 5:06 p.m.

Jupyter Notebook

2 +0

2 +0

6 +0

GitHub
All-Age-Faces-Dataset by JingchunCheng

All-Age-Faces (AAF) Database.

created at Feb. 26, 2019, 12:33 p.m.

Unknown languages

5 +0

169 +2

16 +0

GitHub