open-traffic-collection by graphhopper

Collection of open data resources for traffic information

updated at April 28, 2024, 6:12 p.m.

Unknown languages

26 +0

376 +1

49 +0

GitHub
medal by McGill-NLP

Large medical text dataset curated for abbreviation disambiguation, designed for natural language understanding pre-training in the medical domain

updated at April 28, 2024, 4:27 p.m.

Python

10 +0

208 +4

36 +0

GitHub
bruteforce-database by duyetdev

Bruteforce database

updated at April 28, 2024, 11:44 a.m.

Unknown languages

70 +0

1,378 +3

558 +0

GitHub
uber-tlc-foil-response by fivethirtyeight

Uber trip data from a freedom of information request to NYC's Taxi & Limousine Commission

updated at April 28, 2024, 5:36 a.m.

Unknown languages

70 +0

708 +2

374 +0

GitHub
fma by mdeff

FMA: A Dataset For Music Analysis

updated at April 28, 2024, 5:20 a.m.

Jupyter Notebook

57 -1

2,125 +5

425 +0

GitHub
COVID-19 by CSSEGISandData

Novel Coronavirus (COVID-19) Cases, provided by JHU CSSE

updated at April 27, 2024, 9:59 p.m.

Unknown languages

865 +0

29,173 -6

18,498 +1

GitHub
collection by artsmia

Mia collection metadata

updated at April 27, 2024, 7:33 p.m.

Unknown languages

13 +0

72 +0

10 +0

GitHub
domains by tb0hdan

World’s single largest Internet domains dataset

updated at April 27, 2024, 4:38 p.m.

HTML

29 +0

639 +4

103 +0

GitHub
pudl by catalyst-cooperative

The Public Utility Data Liberation Project provides analysis-ready energy system data to climate advocates, researchers, policymakers, and journalists.

updated at April 27, 2024, 3:31 p.m.

Python

18 +0

443 +2

101 +4

GitHub
tennis_wta by JeffSackmann

WTA Tennis Rankings, Results, and Stats

updated at April 27, 2024, 1:48 p.m.

Unknown languages

29 +0

208 +1

140 +0

GitHub
transfermarkt-datasets by dcaribou

⚽️ Extract, prepare and publish Transfermarkt datasets.

updated at April 27, 2024, 12:21 a.m.

Python

9 +0

174 +3

45 +0

GitHub
covid-19-data by NYTimes

A repository of data on coronavirus cases and deaths in the U.S.

updated at April 26, 2024, 9:43 p.m.

Unknown languages

318 +0

6,986 +0

3,473 -1

GitHub
geo-maps by simonepri

🗺 High Quality GeoJSON maps programmatically generated.

updated at April 26, 2024, 12:41 p.m.

JavaScript

25 +0

1,232 +2

65 +1

GitHub
All-Age-Faces-Dataset by JingchunCheng

All-Age-Faces (AAF) Database.

updated at April 26, 2024, 6:52 a.m.

Unknown languages

4 +0

173 +1

16 +0

GitHub
List-of-Dirty-Naughty-Obscene-and-Otherwise-Bad-Words by LDNOOBW

List of Dirty, Naughty, Obscene, and Otherwise Bad Words

updated at April 26, 2024, 12:22 a.m.

Unknown languages

71 +0

2,765 +9

654 -1

GitHub
tennis_atp by JeffSackmann

ATP Tennis Rankings, Results, and Stats

updated at April 25, 2024, 3:11 a.m.

Unknown languages

94 +0

925 -1

591 +0

GitHub
caption-contest-data by nextml

Data from the caption contest.

updated at April 24, 2024, 5:24 p.m.

HTML

7 +0

5 +0

2 +0

GitHub
country-list by umpirsky

globe with meridians List of all countries with names and ISO 3166-1 codes in all languages and data formats.

updated at April 23, 2024, 10:16 a.m.

HTML

155 +0

5,119 -1

1,544 +0

GitHub
countries by mledoze

World countries in JSON, CSV, XML and Yaml. Any help is welcome!

updated at April 22, 2024, 3:51 a.m.

PHP

159 +0

5,887 +1

1,260 +0

GitHub
congresstweets by alexlitel

Datasets of the daily Twitter output of Congress.

updated at April 21, 2024, 10:20 p.m.

SCSS

7 +0

98 +1

38 +0

GitHub