transfermarkt-datasets by dcaribou

⚽️ Extract, prepare and publish Transfermarkt datasets.

updated at April 27, 2024, 12:21 a.m.

Python

9 +0

174 +3

45 +0

GitHub
tennis_wta by JeffSackmann

WTA Tennis Rankings, Results, and Stats

updated at April 27, 2024, 1:48 p.m.

Unknown languages

29 +0

208 +1

140 +0

GitHub
pudl by catalyst-cooperative

The Public Utility Data Liberation Project provides analysis-ready energy system data to climate advocates, researchers, policymakers, and journalists.

updated at April 27, 2024, 3:31 p.m.

Python

18 +0

443 +2

101 +4

GitHub
domains by tb0hdan

World’s single largest Internet domains dataset

updated at April 27, 2024, 4:38 p.m.

HTML

29 +0

639 +4

103 +0

GitHub
collection by artsmia

Mia collection metadata

updated at April 27, 2024, 7:33 p.m.

Unknown languages

13 +0

72 +0

10 +0

GitHub
COVID-19 by CSSEGISandData

Novel Coronavirus (COVID-19) Cases, provided by JHU CSSE

updated at April 27, 2024, 9:59 p.m.

Unknown languages

865 +0

29,173 -6

18,498 +1

GitHub
fma by mdeff

FMA: A Dataset For Music Analysis

updated at April 28, 2024, 5:20 a.m.

Jupyter Notebook

57 -1

2,125 +5

425 +0

GitHub
uber-tlc-foil-response by fivethirtyeight

Uber trip data from a freedom of information request to NYC's Taxi & Limousine Commission

updated at April 28, 2024, 5:36 a.m.

Unknown languages

70 +0

708 +2

374 +0

GitHub
bruteforce-database by duyetdev

Bruteforce database

updated at April 28, 2024, 11:44 a.m.

Unknown languages

70 +0

1,378 +3

558 +0

GitHub
medal by McGill-NLP

Large medical text dataset curated for abbreviation disambiguation, designed for natural language understanding pre-training in the medical domain

updated at April 28, 2024, 4:27 p.m.

Python

10 +0

208 +4

36 +0

GitHub
open-traffic-collection by graphhopper

Collection of open data resources for traffic information

updated at April 28, 2024, 6:12 p.m.

Unknown languages

26 +0

376 +1

49 +0

GitHub