Multimodal Author Profiling @ PAN 2018

Detailed description of this task can be found @PAN 2018. This code only analyzes tweets in English.

Dataset:The dataset used for this experiments can be downloaded from the PAN 2018.

Dependencies:

gensim
sklearn
nltk

Other requirements:

The GloVe models (100d & 200d) are required for word embeddings.

For image captioning, image caption generation using chainer was used. Need to extract image captions before using the above tool and store it in a csv file (format:imageid \t text).

Running the code

python master.py training_input_add test_input_add test_output_add

Output will be a xml file:

Reference

Please cite the following paper if you find this code is useful.

B. G. Patra, G. Das, and D. Das. 2018. Multimodal Author Profiling for Twitter - Notebook for PAN at CLEF 2018. In Proceedings of the PAN 2018 at CLEF-2018, Avignon, France. link

If you have any query please e-mail us. We welcome bug fixes and new features.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
cleaning		cleaning
ml		ml
processing		processing
processing_images		processing_images
resources		resources
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
master.py		master.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multimodal Author Profiling @ PAN 2018

Running the code

Reference

About

Releases

Packages

Languages

License

pan-webis-de/gopalpatra18

Folders and files

Latest commit

History

Repository files navigation

Multimodal Author Profiling @ PAN 2018

Running the code

Reference

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages