pudl by catalyst-cooperative

The Public Utility Data Liberation Project provides analysis-ready energy system data to climate advocates, researchers, policymakers, and journalists.

updated at May 19, 2024, 3:36 p.m.

Python

18 +0

448 +1

104 +0

GitHub
38-Cloud-A-Cloud-Segmentation-Dataset by SorourMo

This data set includes Landsat 8 images and their manually extracted pixel-level ground truths for cloud detection.

updated at May 19, 2024, 9:21 a.m.

MATLAB

6 +0

139 +1

37 +0

GitHub
COVID-19 by CSSEGISandData

Novel Coronavirus (COVID-19) Cases, provided by JHU CSSE

updated at May 19, 2024, 6:37 a.m.

Unknown languages

865 +0

29,158 -3

18,489 -5

GitHub
List-of-Dirty-Naughty-Obscene-and-Otherwise-Bad-Words by LDNOOBW

List of Dirty, Naughty, Obscene, and Otherwise Bad Words

updated at May 18, 2024, 9:53 p.m.

Unknown languages

71 +0

2,787 +11

653 -2

GitHub
domains by tb0hdan

World’s single largest Internet domains dataset

updated at May 18, 2024, 9:20 p.m.

HTML

30 +0

646 +3

102 -1

GitHub
covid-19-data by NYTimes

A repository of data on coronavirus cases and deaths in the U.S.

updated at May 18, 2024, 2:12 p.m.

Unknown languages

318 +0

6,991 +1

3,472 -1

GitHub
covid-19-data by yahoo

COVID-19 datasets are constructed entirely from primary (government and public agency) sources

updated at May 18, 2024, 9:55 a.m.

Unknown languages

31 +0

110 +1

25 +0

GitHub
bruteforce-database by duyetdev

Bruteforce database

updated at May 18, 2024, 4:32 a.m.

Unknown languages

70 +0

1,388 +4

558 -1

GitHub
fma by mdeff

FMA: A Dataset For Music Analysis

updated at May 18, 2024, 12:10 a.m.

Jupyter Notebook

57 +0

2,142 +4

426 +0

GitHub
tennis_atp by JeffSackmann

ATP Tennis Rankings, Results, and Stats

updated at May 17, 2024, 4:15 p.m.

Unknown languages

96 +0

939 +6

592 +0

GitHub
collection by artsmia

Mia collection metadata

updated at May 17, 2024, 3:55 p.m.

Unknown languages

13 +0

72 +0

10 +0

GitHub
awesome-citygml by OloOcki

The ultimate list of open data semantic 3D city models

updated at May 17, 2024, 7:30 a.m.

Unknown languages

7 +0

186 +1

24 +0

GitHub
transfermarkt-datasets by dcaribou

⚽️ Extract, prepare and publish Transfermarkt datasets.

updated at May 17, 2024, 5:12 a.m.

Python

9 +0

179 +3

48 +2

GitHub
open-data by freeCodeCamp

None

updated at May 16, 2024, 11:03 p.m.

HTML

16 +0

156 +2

41 +0

GitHub
medal by McGill-NLP

Large medical text dataset curated for abbreviation disambiguation, designed for natural language understanding pre-training in the medical domain

updated at May 16, 2024, 7:12 p.m.

Python

11 +0

212 +3

36 +0

GitHub
country-list by umpirsky

globe with meridians List of all countries with names and ISO 3166-1 codes in all languages and data formats.

updated at May 16, 2024, 1:12 p.m.

HTML

155 +0

5,120 +1

1,546 +1

GitHub
countries by mledoze

World countries in JSON, CSV, XML and Yaml. Any help is welcome!

updated at May 16, 2024, 10:51 a.m.

PHP

159 +0

5,900 +5

1,260 -1

GitHub
geo-maps by simonepri

🗺 High Quality GeoJSON maps programmatically generated.

updated at May 16, 2024, 10:50 a.m.

JavaScript

25 +0

1,237 +2

65 +0

GitHub
usa-soccer by gavinr

USA soccer teams - location and metadata

updated at May 16, 2024, 10:19 a.m.

JavaScript

5 +0

15 +1

12 +0

GitHub
caption-contest-data by nextml

Data from the caption contest.

updated at May 15, 2024, 5:17 p.m.

HTML

7 +0

5 +0

2 +0

GitHub