Info

This is a repository that contains information on loading and using datasets in PyTorch, Tensorflow and Sklearn. The datasets are divided by modality, as shown below in "List of Datasets".

How To Use

Each dataset has its own Jupyter Notebook with loading instructions, number of train and test samples, and plotted examples.

Author

The author is responsible for the content and quality of the code. Please refer to The Learning Machine (thelearningmachine.ai) for any remarks.

List of Datasets

Image

MNIST
FashionMNIST
CIFAR-10 and 100

Natural Language Processing

GLUE Benchmark: 11 NLP tasks
WMT: Machine Translation

Others

Iris dataset

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
image datasets		image datasets
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Info

How To Use

Author

List of Datasets

Image

Natural Language Processing

Others

About

Releases

Packages

Languages

the-learning-machine/ML-datasets

Folders and files

Latest commit

History

Repository files navigation

Info

How To Use

Author

List of Datasets

Image

Natural Language Processing

Others

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages