transfer the py2-version repository w2vv (official code https://github.com/danieljf24/w2vv) to py3-version code and test on Windows 10
- adjust the
print
function - solve the encoding problem of reading word2vec
id.txt
and building dict intext2vec.py
import cPickle as pkl
toimport _pickle as pkl
open(".pkl", "r")
toopen(".pkl", "rb")
- all
map(x,xxx)
function tolist(map(x,xxx))
rank = np.where(inds/n_caption == i)[0][0]
ini2t
function inevaluation.py
torank = np.where(inds//n_caption == i)[0][0]
, since all 5 captions of the same image are taken into evluation- etc ...
- Windows 10
- python 3.5
- TensorFlow 1.8.0 Cuda 9.0
- keras 2.1.6 with TensorFlow backend
- pydot 1.2.3 + Graphviz
- tensorboard_logger for tensorboard visualization
My test : Image to text: recall@1 34.2, recall@5 64.9, recall@10 77.3, 3.0, 13.6
Paper Report: Image to text: recall@1 36.3, recall@5 66.4, recall@10 78.2