CNN for Text Classification

PyTorch implementation of the CNN detailed in the paper Convolutional Neural Netowrks for Sentence Classification.

Process Data

The data is available in the ./data folder. To process run:

python3 process_data.py <word2vec_bin_path> ./data

The pre-trained Google News word2vec binary can be downloaded here. Alternatively you can run make google-news-word2vec which will download the file to your directory. You will need wget installed.

Training

python3 main.py <pickle_file> <mode>

The <pickle_file> is path the path to the output from the process step above. Training can be run in various modes as specified in the paper. The three available modes are:

random: Randomnly initialised embedding matrix which is updated during training
static: Uses pre-trained embeddings which are fixed during training
non_static: Starts with the pre-trained embeddings and fine tunes on the dataset.

Training follow the paper and uses 10-fold cross validation. The number of folds can be changed to with the --cv_folds argument. Note however, this should match the value used in the preprocessing step

Finally, if training on a gpu, set the --use_gpu to true.

python3 main.py <pickle_file> <mode>  --use_gpu true

Other parameters

The CNN model parameters and other training parameters are specified in main.py.

Other Comments

Experiments were run using the Adadelta optimiser however, unfortunately, we were unable to match the performance described in the paper. The best results were approximately 70%. Adam with a learning rate of 0.0001 performed marginally better.

Requirements

python 3.7

pip3 install -r requirements.txt

References

Original paper Convolutional Neural Network for Sentence Classification and code
Denny Britz TensorFlow implementation & blog

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

CNN for Text Classification

Process Data

Training

Other parameters

Other Comments

Requirements

References

Files

README.md

Latest commit

History

README.md

File metadata and controls

CNN for Text Classification

Process Data

Training

Other parameters

Other Comments

Requirements

References