transfermarkt-datasets by dcaribou

⚽️ Extract, prepare and publish Transfermarkt datasets.

updated at April 14, 2024, 1:29 p.m.

Python

9 +2

170 +4

45 +1

GitHub
lemon-dataset by softwaremill

Lemons quality control dataset

updated at April 14, 2024, 12:31 p.m.

Unknown languages

5 +0

98 +1

12 +0

GitHub
collection by tategallery

Tate Collection metadata

updated at April 13, 2024, 6:16 p.m.

Python

59 +0

505 -1

186 +0

GitHub
covid-19-data by NYTimes

A repository of data on coronavirus cases and deaths in the U.S.

updated at April 13, 2024, 3:27 p.m.

Unknown languages

318 +0

6,987 +1

3,474 -1

GitHub
uber-tlc-foil-response by fivethirtyeight

Uber trip data from a freedom of information request to NYC's Taxi & Limousine Commission

updated at April 13, 2024, 12:48 p.m.

Unknown languages

70 +0

705 +1

374 +0

GitHub
COVID-19 by CSSEGISandData

Novel Coronavirus (COVID-19) Cases, provided by JHU CSSE

updated at April 13, 2024, 7:22 a.m.

Unknown languages

865 +0

29,177 -7

18,501 -7

GitHub
country-list by umpirsky

globe with meridians List of all countries with names and ISO 3166-1 codes in all languages and data formats.

updated at April 13, 2024, 12:49 a.m.

HTML

155 +0

5,119 +2

1,546 +0

GitHub
tennis_atp by JeffSackmann

ATP Tennis Rankings, Results, and Stats

updated at April 12, 2024, 10:51 p.m.

Unknown languages

94 +0

924 +0

591 +0

GitHub
tennis_wta by JeffSackmann

WTA Tennis Rankings, Results, and Stats

updated at April 12, 2024, 10:51 p.m.

Unknown languages

29 +0

207 -1

139 +0

GitHub
congresstweets by alexlitel

Datasets of the daily Twitter output of Congress.

updated at April 12, 2024, 5:17 p.m.

SCSS

7 +1

97 +3

39 +1

GitHub
domains by tb0hdan

World’s single largest Internet domains dataset

updated at April 12, 2024, 3:20 p.m.

HTML

29 +0

632 +0

103 +0

GitHub
38-Cloud-A-Cloud-Segmentation-Dataset by SorourMo

This data set includes Landsat 8 images and their manually extracted pixel-level ground truths for cloud detection.

updated at April 12, 2024, 8:25 a.m.

MATLAB

6 +0

138 +1

37 +0

GitHub
bruteforce-database by duyetdev

Bruteforce database

updated at April 12, 2024, 2:01 a.m.

Unknown languages

70 +0

1,368 +5

557 +0

GitHub
All-Age-Faces-Dataset by JingchunCheng

All-Age-Faces (AAF) Database.

updated at April 11, 2024, 9:25 p.m.

Unknown languages

4 +0

172 +2

16 +0

GitHub
countries by mledoze

World countries in JSON, CSV, XML and Yaml. Any help is welcome!

updated at April 11, 2024, 4:30 p.m.

PHP

160 +0

5,884 +5

1,260 +1

GitHub
List-of-Dirty-Naughty-Obscene-and-Otherwise-Bad-Words by LDNOOBW

List of Dirty, Naughty, Obscene, and Otherwise Bad Words

updated at April 11, 2024, 2:27 p.m.

Unknown languages

71 +0

2,743 +0

652 +0

GitHub
pudl by catalyst-cooperative

The Public Utility Data Liberation Project provides analysis-ready energy system data to climate advocates, researchers, policymakers, and journalists.

updated at April 11, 2024, 2:14 p.m.

Python

18 +0

440 +2

97 +4

GitHub
fma by mdeff

FMA: A Dataset For Music Analysis

updated at April 11, 2024, 1:59 p.m.

Jupyter Notebook

58 +0

2,115 +6

424 +0

GitHub
SaudiNewsNet by ParallelMazen

This repo contains a set of Arabic newspaper articles alongwith metadata, extracted from various Saudi newspapers.

updated at April 10, 2024, 1:19 a.m.

Unknown languages

7 +0

66 -1

16 +0

GitHub
geo-maps by simonepri

🗺 High Quality GeoJSON maps programmatically generated.

updated at April 9, 2024, 4:18 p.m.

JavaScript

25 +0

1,230 +4

63 +0

GitHub