COVID-19 by CSSEGISandData

Novel Coronavirus (COVID-19) Cases, provided by JHU CSSE

created at Feb. 4, 2020, 10:03 p.m.

Unknown languages

865 -1

29,101 -4

18,377 -3

GitHub
covid-19-data by NYTimes

A repository of data on coronavirus cases and deaths in the U.S.

created at March 24, 2020, 11:41 p.m.

Unknown languages

318 +0

6,992 +0

3,460 +0

GitHub
countries by mledoze

World countries in JSON, CSV, XML and Yaml. Any help is welcome!

created at Jan. 6, 2012, 2:01 p.m.

PHP

158 +0

5,990 +1

1,273 +0

GitHub
country-list by umpirsky

globe with meridians List of all countries with names and ISO 3166-1 codes in all languages and data formats.

created at March 2, 2012, 11:23 a.m.

HTML

154 +0

5,178 +4

1,553 +0

GitHub
List-of-Dirty-Naughty-Obscene-and-Otherwise-Bad-Words by LDNOOBW

List of Dirty, Naughty, Obscene, and Otherwise Bad Words

created at March 9, 2012, 2:15 a.m.

Unknown languages

74 +0

2,962 +6

664 -1

GitHub
fma by mdeff

FMA: A Dataset For Music Analysis

created at Dec. 2, 2016, 3:27 p.m.

Jupyter Notebook

58 +0

2,267 +2

444 +0

GitHub
bruteforce-database by duyetdev

Bruteforce database

created at Oct. 4, 2015, 2:55 p.m.

Unknown languages

72 +0

1,473 +4

575 +1

GitHub
geo-maps by simonepri

đź—ş High Quality GeoJSON maps programmatically generated.

created at Sept. 2, 2017, 12:42 p.m.

JavaScript

25 +0

1,280 +1

66 +0

GitHub
tennis_atp by JeffSackmann

ATP Tennis Rankings, Results, and Stats

created at March 13, 2015, 12:15 p.m.

Unknown languages

100 +0

1,038 +3

612 -1

GitHub
domains by tb0hdan

World’s single largest Internet domains dataset

created at Jan. 12, 2020, 10:39 p.m.

HTML

31 +0

722 +3

112 +1

GitHub
uber-tlc-foil-response by fivethirtyeight

Uber trip data from a freedom of information request to NYC's Taxi & Limousine Commission

created at Aug. 28, 2015, 6:38 p.m.

Unknown languages

69 +0

713 +0

374 +0

GitHub
collection by tategallery

Tate Collection metadata

created at Sept. 16, 2013, 4:05 p.m.

Python

60 +0

513 +0

187 +0

GitHub
pudl by catalyst-cooperative

The Public Utility Data Liberation Project provides analysis-ready energy system data to climate advocates, researchers, policymakers, and journalists.

created at Feb. 1, 2017, 5:45 p.m.

Python

19 +0

492 +2

117 +1

GitHub
twofishes by foursquare

MOVED - The project is still under development but this page is deprecated.

created at Feb. 17, 2012, 11:58 p.m.

Scala

201 +0

434 +0

62 +0

GitHub
open-traffic-collection by graphhopper

Collection of open data resources for traffic information

created at April 13, 2015, 11:39 a.m.

Unknown languages

28 +0

408 +1

48 +1

GitHub
medal by McGill-NLP

Large medical text dataset curated for abbreviation disambiguation, designed for natural language understanding pre-training in the medical domain

created at April 22, 2020, 3:13 a.m.

Python

11 +0

250 +0

41 +0

GitHub
transfermarkt-datasets by dcaribou

⚽️ Extract, prepare and publish Transfermarkt datasets.

created at Dec. 26, 2020, 5:33 p.m.

Python

10 +0

248 +1

57 +1

GitHub
awesome-citygml by OloOcki

The ultimate list of open data semantic 3D city models

created at Jan. 7, 2021, 4:48 p.m.

Unknown languages

10 +1

244 +4

30 +0

GitHub
collection by cooperhewitt

Collection Data for Cooper Hewitt, Smithsonian Design Museum

created at Feb. 14, 2012, 7:18 p.m.

Unknown languages

38 +0

231 +0

46 +0

GitHub
tennis_wta by JeffSackmann

WTA Tennis Rankings, Results, and Stats

created at March 13, 2015, 12:21 p.m.

Unknown languages

30 +0

229 +0

150 +0

GitHub