Skip to content

Latest commit

 

History

History
69 lines (55 loc) · 4.58 KB

datasets.md

File metadata and controls

69 lines (55 loc) · 4.58 KB

Introduction

This is a growing collection of dataset I come across and I found interesting for various task related to Deep Learning.

Classification

Papers

Sound

One possible way to go is transfom those sounds into images using librosa library then train a CNN on top of this.

Image

It can be anything: planes, machineray, animals, cities, planets/stars, pokemons, Marvel characters, video games, fruits, road signs.

Satelite
  • DIUx xView 2018 Detection Challenge OBJECTS IN CONTEXT IN OVERHEAD IMAGERY - link
Emotion
X-Ray
  • Chest X-Ray Images (Pneumonia) - link paper
Nature
Others

Whales

  • Using deep learning to listen for whales - link
  • Whale FM: recordings of Orca and Pilot Whale - link
  • The Marinexplore and Cornell University Whale Detection Challenge - link #sound
  • Right Whale Recognition - link #image
  • Watkins Marine Mammal Sound Database - link #sound
  • NOAA Northest Fisheries Science Center - link

OCR

  • Arabic scientific manuscripts - link
  • Centre for Pattern Recognition and Machine Intelligence - link
  • Arabic Natural Language Processing at Stanford - link

Divers

  • Figure Eight: Data for Everyone - link
  • CommaAI - link

NLP

  • Datasets for Natural Language Processing - link
  • NLP datasets - link

Recommendation

  • a collection of datasets - link