38-Cloud-A-Cloud-Segmentation-Dataset by SorourMo

This data set includes Landsat 8 images and their manually extracted pixel-level ground truths for cloud detection.

created at Feb. 6, 2019, 12:11 a.m.

MATLAB

6 +0

138 +0

37 +0

GitHub
3w_dataset by ricardovvargas

The first realistic and public dataset with rare undesirable real events in oil wells.

created at Jan. 19, 2019, 12:30 a.m.

Jupyter Notebook

13 +0

104 +0

56 +0

GitHub
usa-soccer by gavinr

USA soccer teams - location and metadata

created at Nov. 27, 2018, 3:34 p.m.

JavaScript

5 +0

14 +0

12 +0

GitHub
coin_registry by Blockmodo

A global registry of JSON formatted files on 1500+ cryptocurrency tokens. Provides information like chat rooms, communities, explorers, and contact information on each coin. Used by https://blockmodo.com, DEXs, developers, and exchanges.

created at June 20, 2018, 6:15 a.m.

Unknown languages

8 +0

113 +0

34 +0

GitHub
gun-violence-data by jamesqo

A comprehensive, accessible database that contains records of over 260k US gun violence incidents from January 2013 to March 2018.

created at April 2, 2018, 6:54 p.m.

Python

0 +0

3 +0

2 +0

GitHub
tracebase by areinhardt

The tracebase appliance-level power consumption data set

created at Nov. 17, 2017, 12:09 p.m.

Unknown languages

2 +0

36 +0

15 +0

GitHub
geo-maps by simonepri

🗺 High Quality GeoJSON maps programmatically generated.

created at Sept. 2, 2017, 12:42 p.m.

JavaScript

25 +0

1,230 +0

64 +1

GitHub
congresstweets by alexlitel

Datasets of the daily Twitter output of Congress.

created at May 3, 2017, 10:47 p.m.

SCSS

7 +0

97 +0

38 -1

GitHub
pudl by catalyst-cooperative

The Public Utility Data Liberation Project provides analysis-ready energy system data to climate advocates, researchers, policymakers, and journalists.

created at Feb. 1, 2017, 5:45 p.m.

Python

18 +0

441 +1

97 +0

GitHub
mcafp by google

None

created at Dec. 9, 2016, 8:10 a.m.

Python

2 +0

39 +0

17 +0

GitHub
fma by mdeff

FMA: A Dataset For Music Analysis

created at Dec. 2, 2016, 3:27 p.m.

Jupyter Notebook

58 +0

2,120 +5

425 +1

GitHub
pinhooker by phillc73

An R Package to compile data sets of historic results from thoroughbred sales

created at Jan. 15, 2016, 7:30 p.m.

R

5 +0

2 +0

0 +0

GitHub
open-data by freeCodeCamp

None

created at Nov. 25, 2015, 10:15 p.m.

HTML

16 +0

152 +1

41 +0

GitHub
caption-contest-data by nextml

Data from the caption contest.

created at Nov. 23, 2015, 11:46 p.m.

HTML

7 +0

5 +0

2 +0

GitHub
bruteforce-database by duyetdev

Bruteforce database

created at Oct. 4, 2015, 2:55 p.m.

Unknown languages

70 +0

1,375 +7

558 +1

GitHub
uber-tlc-foil-response by fivethirtyeight

Uber trip data from a freedom of information request to NYC's Taxi & Limousine Commission

created at Aug. 28, 2015, 6:38 p.m.

Unknown languages

70 +0

706 +1

374 +0

GitHub
skytrax-reviews-dataset by quankiquanki

An air travel dataset consisting of user reviews from Skytrax (www.airlinequality.com)

created at Aug. 11, 2015, 11:28 a.m.

Python

1 +0

70 +0

39 +0

GitHub
SaudiNewsNet by ParallelMazen

This repo contains a set of Arabic newspaper articles alongwith metadata, extracted from various Saudi newspapers.

created at July 21, 2015, 8:40 p.m.

Unknown languages

7 +0

66 +0

16 +0

GitHub
open-traffic-collection by graphhopper

Collection of open data resources for traffic information

created at April 13, 2015, 11:39 a.m.

Unknown languages

26 +0

375 +0

49 +1

GitHub
tennis_wta by JeffSackmann

WTA Tennis Rankings, Results, and Stats

created at March 13, 2015, 12:21 p.m.

Unknown languages

29 +0

207 +0

140 +1

GitHub