Skip to content

This repository contains Jupyter notebooks on how to download and handle well known datasets in Python. 📚 🐍

Notifications You must be signed in to change notification settings

the-learning-machine/ML-datasets

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 

Repository files navigation

Info

This is a repository that contains information on loading and using datasets in PyTorch, Tensorflow and Sklearn. The datasets are divided by modality, as shown below in "List of Datasets".

How To Use

Each dataset has its own Jupyter Notebook with loading instructions, number of train and test samples, and plotted examples.

Author

The author is responsible for the content and quality of the code. Please refer to The Learning Machine (thelearningmachine.ai) for any remarks.

List of Datasets

Image

  • MNIST
  • FashionMNIST
  • CIFAR-10 and 100

Natural Language Processing

  • GLUE Benchmark: 11 NLP tasks
  • WMT: Machine Translation

Others

  • Iris dataset

About

This repository contains Jupyter notebooks on how to download and handle well known datasets in Python. 📚 🐍

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published