Using price and news feature based on Bi-RNN.
- Python 2.7
- Numpy 1.12
- Tensorflow r1.0
- Tqdm
- Pyltp
- Word2Vec
- GloVe
- FastText
In run.sh, you can set the start_step and end_step to set which part you want to run. And set other parameters to correct data paths.
- step 0, pre-process datas.
- step 1, calculate pmi related info. (Not used by this model)
- step 2, train GloVe word embedding.
- step 3, train Word2Vec word embedding.
- step 4, train fastText word embedding.
- step 5, train this model. All the parameters can be set in
model.py
. UseCtrl+C
to stop training when you think it's converged. - step 6, test this model.
- data_utils.py, for data processing
- pmi.py, for PMI related calculation
- model.py, for model training and testing
- train_fastText.sh, train fastText doc embedding
- train_word2vec.sh, train Word2Vec word embedding
- train_glove.sh, train GloVe word embedding
- run.sh, script for whole process