./notebooks
- practical examples (to run them installrequirements.txt
)./data
- here you can find the data that will be used throughout the workshop./src
- helper functions (mostly data gathering)./docker
- Dockerfiles for Morfeusz and MACA / krnnt (for working with Polish language)
We store large files with Git LFS. To download all data used in this tutorial you need to have Git LFS installed. Then, after cloning the repo, run the following.
git lfs pull
- Intro, around the NLP
- Text Normalization and RegEx
- LDA
- Lemma, PoS, dependency-parsing
- Lemmatization in Polish (morfeusz)
- BoW models
- word2vec
- Introduction to neural networks
- RNNs
- LSTMs