domains by tb0hdan

World’s single largest Internet domains dataset

updated at May 26, 2024, 6:30 p.m.

HTML

30 +0

649 +3

102 +0

GitHub
tennis_atp by JeffSackmann

ATP Tennis Rankings, Results, and Stats

updated at May 26, 2024, 10:40 a.m.

Unknown languages

96 +0

943 +4

593 +1

GitHub
fma by mdeff

FMA: A Dataset For Music Analysis

updated at May 26, 2024, 10:27 a.m.

Jupyter Notebook

57 +0

2,149 +7

426 +0

GitHub
SaudiNewsNet by ParallelMazen

This repo contains a set of Arabic newspaper articles alongwith metadata, extracted from various Saudi newspapers.

updated at May 26, 2024, 9:55 a.m.

Unknown languages

7 +0

65 -1

15 +0

GitHub
countries by mledoze

World countries in JSON, CSV, XML and Yaml. Any help is welcome!

updated at May 26, 2024, 9:51 a.m.

PHP

159 +0

5,903 +3

1,260 +0

GitHub
coin_registry by Blockmodo

A global registry of JSON formatted files on 1500+ cryptocurrency tokens. Provides information like chat rooms, communities, explorers, and contact information on each coin. Used by https://blockmodo.com, DEXs, developers, and exchanges.

updated at May 26, 2024, 3:59 a.m.

Unknown languages

8 +0

113 +0

33 -1

GitHub
COVID-19 by CSSEGISandData

Novel Coronavirus (COVID-19) Cases, provided by JHU CSSE

updated at May 26, 2024, 3:29 a.m.

Unknown languages

865 +0

29,158 +0

18,486 -3

GitHub
transfermarkt-datasets by dcaribou

⚽️ Extract, prepare and publish Transfermarkt datasets.

updated at May 25, 2024, 7:34 p.m.

Python

9 +0

187 +8

48 +0

GitHub
List-of-Dirty-Naughty-Obscene-and-Otherwise-Bad-Words by LDNOOBW

List of Dirty, Naughty, Obscene, and Otherwise Bad Words

updated at May 25, 2024, 7:08 p.m.

Unknown languages

71 +0

2,786 -1

655 +2

GitHub
pudl by catalyst-cooperative

The Public Utility Data Liberation Project provides analysis-ready energy system data to climate advocates, researchers, policymakers, and journalists.

updated at May 25, 2024, 6:44 p.m.

Python

18 +0

449 +1

106 +2

GitHub
bruteforce-database by duyetdev

Bruteforce database

updated at May 25, 2024, 11:42 a.m.

Unknown languages

70 +0

1,391 +3

558 +0

GitHub
tennis_wta by JeffSackmann

WTA Tennis Rankings, Results, and Stats

updated at May 25, 2024, 11:13 a.m.

Unknown languages

30 +0

211 +2

142 +1

GitHub
covid-19-data by NYTimes

A repository of data on coronavirus cases and deaths in the U.S.

updated at May 24, 2024, 10:59 a.m.

Unknown languages

318 +0

6,994 +3

3,471 -1

GitHub
country-list by umpirsky

globe with meridians List of all countries with names and ISO 3166-1 codes in all languages and data formats.

updated at May 23, 2024, 4:10 p.m.

HTML

155 +0

5,123 +3

1,547 +1

GitHub
gun-violence-data by jamesqo

A comprehensive, accessible database that contains records of over 260k US gun violence incidents from January 2013 to March 2018.

updated at May 23, 2024, 3:48 p.m.

Python

0 +0

4 +1

2 +0

GitHub
awesome-citygml by OloOcki

The ultimate list of open data semantic 3D city models

updated at May 23, 2024, 7:32 a.m.

Unknown languages

7 +0

187 +1

23 -1

GitHub
collection by artsmia

Mia collection metadata

updated at May 22, 2024, 5:41 p.m.

Unknown languages

13 +0

72 +0

10 +0

GitHub
caption-contest-data by nextml

Data from the caption contest.

updated at May 22, 2024, 5:15 p.m.

HTML

7 +0

5 +0

2 +0

GitHub
medal by McGill-NLP

Large medical text dataset curated for abbreviation disambiguation, designed for natural language understanding pre-training in the medical domain

updated at May 22, 2024, 2:40 p.m.

Python

11 +0

214 +2

36 +0

GitHub
open-traffic-collection by graphhopper

Collection of open data resources for traffic information

updated at May 22, 2024, 11:07 a.m.

Unknown languages

26 +0

382 +2

49 +0

GitHub