COVID-19 by CSSEGISandData

Novel Coronavirus (COVID-19) Cases, provided by JHU CSSE

created at Feb. 4, 2020, 10:03 p.m.

Unknown languages

866 +0

29,107 -6

18,389 -3

GitHub
covid-19-data by NYTimes

A repository of data on coronavirus cases and deaths in the U.S.

created at March 24, 2020, 11:41 p.m.

Unknown languages

318 +0

6,991 +0

3,461 +0

GitHub
countries by mledoze

World countries in JSON, CSV, XML and Yaml. Any help is welcome!

created at Jan. 6, 2012, 2:01 p.m.

PHP

158 -1

5,981 +4

1,272 +0

GitHub
country-list by umpirsky

globe with meridians List of all countries with names and ISO 3166-1 codes in all languages and data formats.

created at March 2, 2012, 11:23 a.m.

HTML

154 +0

5,175 +3

1,554 +1

GitHub
List-of-Dirty-Naughty-Obscene-and-Otherwise-Bad-Words by LDNOOBW

List of Dirty, Naughty, Obscene, and Otherwise Bad Words

created at March 9, 2012, 2:15 a.m.

Unknown languages

73 +0

2,937 +1

666 +0

GitHub
fma by mdeff

FMA: A Dataset For Music Analysis

created at Dec. 2, 2016, 3:27 p.m.

Jupyter Notebook

58 +0

2,247 +6

440 +1

GitHub
bruteforce-database by duyetdev

Bruteforce database

created at Oct. 4, 2015, 2:55 p.m.

Unknown languages

72 +0

1,456 +7

571 +4

GitHub
geo-maps by simonepri

đź—ş High Quality GeoJSON maps programmatically generated.

created at Sept. 2, 2017, 12:42 p.m.

JavaScript

25 +0

1,275 +0

66 +0

GitHub
tennis_atp by JeffSackmann

ATP Tennis Rankings, Results, and Stats

created at March 13, 2015, 12:15 p.m.

Unknown languages

100 +0

1,025 +1

610 +2

GitHub
uber-tlc-foil-response by fivethirtyeight

Uber trip data from a freedom of information request to NYC's Taxi & Limousine Commission

created at Aug. 28, 2015, 6:38 p.m.

Unknown languages

69 +0

713 +0

374 +0

GitHub
domains by tb0hdan

World’s single largest Internet domains dataset

created at Jan. 12, 2020, 10:39 p.m.

HTML

30 +0

712 +7

109 +1

GitHub
collection by tategallery

Tate Collection metadata

created at Sept. 16, 2013, 4:05 p.m.

Python

60 +0

511 +0

187 +0

GitHub
pudl by catalyst-cooperative

The Public Utility Data Liberation Project provides analysis-ready energy system data to climate advocates, researchers, policymakers, and journalists.

created at Feb. 1, 2017, 5:45 p.m.

Python

19 +0

482 +4

110 +0

GitHub
twofishes by foursquare

MOVED - The project is still under development but this page is deprecated.

created at Feb. 17, 2012, 11:58 p.m.

Scala

201 +0

434 +0

62 +0

GitHub
open-traffic-collection by graphhopper

Collection of open data resources for traffic information

created at April 13, 2015, 11:39 a.m.

Unknown languages

28 +0

404 +1

47 -1

GitHub
transfermarkt-datasets by dcaribou

⚽️ Extract, prepare and publish Transfermarkt datasets.

created at Dec. 26, 2020, 5:33 p.m.

Python

10 +0

247 +4

57 +1

GitHub
medal by McGill-NLP

Large medical text dataset curated for abbreviation disambiguation, designed for natural language understanding pre-training in the medical domain

created at April 22, 2020, 3:13 a.m.

Python

11 +0

245 +1

40 +0

GitHub
awesome-citygml by OloOcki

The ultimate list of open data semantic 3D city models

created at Jan. 7, 2021, 4:48 p.m.

Unknown languages

9 +0

235 +2

29 +0

GitHub
collection by cooperhewitt

Collection Data for Cooper Hewitt, Smithsonian Design Museum

created at Feb. 14, 2012, 7:18 p.m.

Unknown languages

38 +0

229 +0

46 +0

GitHub
tennis_wta by JeffSackmann

WTA Tennis Rankings, Results, and Stats

created at March 13, 2015, 12:21 p.m.

Unknown languages

30 +0

227 +0

148 +2

GitHub