Relation-aware Graph Attention Network for Visual Question Answering

An unofficial Tensorflow 2.x based implementation of Relation-Aware Graph Attention Network for Visual Question Answering, ICCV 2019 paper.

This is re-write of PyToch 1.0.1 based implementation available here. Some parts are work in progress (explicit relation encoder, semantic relation encoder, BAN and MuTAN). You can train BUTD based model.

Environment

Tensorflow 2.x

Getting started

1. Download data

source download.sh

The total size of data is about 90GB, and the structure of dataset is as follows.

├── data
│   ├── Answers
│   ├── Bottom-up-features-adaptive
│   ├── Bottom-up-features-fixed
│   ├── cache
│   ├── cp_v2_annotations
│   ├── cp_v2_questions
│   ├── glove
│   ├── imgids
│   ├── Questions
│   ├── visualGenome

2. Train the model

python main.py --config config/butd_vqa.json

I trained the model with A100 40GB GPU (batch size: 256), and the code takes about 39GB GPU RAM.

3. Evaluate the model

python main.py --config config/butd_vqa.json  --mode eval --checkpoint <pretrained_model_path>

Result

	Accuracy (BUTD fusion)
Official PyTorch Code	63.99
Tensorflow 2.0 Code	63.24

You can check the train result in the train.ipynb.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
config		config
data/tfidf		data/tfidf
model		model
README.md		README.md
dataset.py		dataset.py
download.sh		download.sh
main.py		main.py
train.ipynb		train.ipynb
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Relation-aware Graph Attention Network for Visual Question Answering

Environment

Getting started

1. Download data

2. Train the model

3. Evaluate the model

Result

About

Releases

Packages

Languages

jhss/TF_VQA_ReGAT

Folders and files

Latest commit

History

Repository files navigation

Relation-aware Graph Attention Network for Visual Question Answering

Environment

Getting started

1. Download data

2. Train the model

3. Evaluate the model

Result

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages