The purpose of this repository is to improve the paper
To install PyTorch, see installation instructions on the PyTorch website.
To install TorchText:
pip install torchtext
We'll also make use of spaCy to tokenize our data. To install spaCy, follow the instructions here making sure to install the English models with:
python -m spacy download en
You can get the emoji dataset collected in twitter this link. This dataset is provided by SemEval 2018 Task 2. In order to run these notebook, you must put on thawed Semeval2018-Task2-EmojiPrediction
folder to current directory.
To shape the dataset:
python prepare.py