ML Playground

This repository collects my deep learning experiments. It is a Python project whose dependencies can be installed using Poetry.

Models

The ml-playground folder contains implementations of different models using PyTorch where I tend to reimplement most of the layers I use for educational purposes.

So far, it includes the following models:

Recurrent Neural Network (RNN)
Long short-term memory (LSTM)¹
Transformer²
Bidirectional Encoder Representations from Transformers (BERT)³
Graph Convolution Network (GCN)⁴

More models will follow.

Examples

The examples folder contains two NLP applications based on the Transformer model. They can be run with the command python -m examples.<example name> which will display the available features. They both come with pretrained weights and with an integration with the Language Interpretability Tool⁵ which allows to visualise attention weights among other informations.

Language Model

A language model based on a decoder-only Transformer. It has been trained on the Tiny Stories⁶ corpus which consists of short-length stories written in basic English.

For example, when prompted with "Once upon a time", it generates the following text:

Once upon a time, there was a little girl named Lily. She loved to play outside in the sunshine. One day, she went to the park to play on the swings. She saw a boy sitting on a bench.

"Hi, little girl! What are you doing here?" replied the boy.

"I'm looking for something to do," replied Lily.

The boy smiled and said, "That's a great idea, little girl. Would you like to play with me?"

Lily was so happy and said, "Yes, please!"

The boy gave her a big hug and said, "You're welcome, little girl. I'm glad I could join you."

Lily and the boy spent the day playing together in the sunshine.

Translator

A simple translation model powered by a Transformer which translates English sentences to French sentences. It has been trained on the English-to-French corpus of the Tatoeba project released under CC-BY 2.0 FR.

For example, the sentence

She saw a boy sitting on a bench.

is correctly translated to

Elle vit un garçon assis sur un banc.

Note that this model remains quite unreliable due to the small size of its dataset.

References

Hochreiter, S., & Schmidhuber, J. (1997). Long short-term memory. Neural computation, 9(8), 1735-1780. ↩
Vaswani, A., Shazeer, N.M., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, L., & Polosukhin, I. (2017). Attention is All you Need. Neural Information Processing Systems. ↩
Devlin, J., Chang, M., Lee, K., & Toutanova, K. (2019). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. North American Chapter of the Association for Computational Linguistics. ↩
Kipf, T., & Welling, M. (2016). Semi-Supervised Classification with Graph Convolutional Networks. ArXiv, abs/1609.02907. ↩
Eldan, R., & Li, Y. (2023). TinyStories: How Small Can Language Models Be and Still Speak Coherent English? ArXiv, abs/2305.07759. ↩
Tenney, I., Wexler, J., Bastings, J., Bolukbasi, T., Coenen, A., Gehrmann, S., Jiang, E., Pushkarna, M., Radebaugh, C., Reif, E., & Yuan, A. (2020). The Language Interpretability Tool: Extensible, Interactive Visualizations and Analysis for NLP Models. Conference on Empirical Methods in Natural Language Processing. ↩

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
examples		examples
ml_playground		ml_playground
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ML Playground

Models

Examples

Language Model

Translator

References

About

Releases

Packages

Languages

allioux/ml-playground

Folders and files

Latest commit

History

Repository files navigation

ML Playground

Models

Examples

Language Model

Translator

References

Footnotes

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages