CONVOLUTIONAL VIDEO CAPTIONING

Video Captioning using CNN decoder

Abstract:

Temporal action localization is an important yet challenging problem. Given a long, untrimmed video consisting of multiple action instances and complex background contents, we need not only to recognize their action categories, but also to localize the start time and end time of each instance.Although Many state-of-the-art action localization models use 3D convnets (C3D network) which are efficient in abstracting action semantics achieve accurate recognition of actions but can not precisely localize these actions as temporal length of the input reduces by factor of 8.So,to achieve precise localization of action boundaries this paper has designed a novel Convolutional-De-Convolutional(CDC) network,which places CDC layers on top of 3D convnets .These CDC layers perform temporal upsampling and spatial downsampling operations simultaneously to predicts action instances at frame level granularity .Finally,the CDC network demonstrates high efficiency in jointly modeling action semantics in space-time and fine-grained tem poral dynamics.

Architecture

Results

Note: This Repository is no longer maintained

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
__pycache__		__pycache__
model		model
output		output
pycocoevalcap		pycocoevalcap
third_party/coco-caption		third_party/coco-caption
README.md		README.md
Vid_convcap_architecture.png		Vid_convcap_architecture.png
beamsearch.py		beamsearch.py
cvc_result.PNG		cvc_result.PNG
dataloader.py		dataloader.py
dataloader.pyc		dataloader.pyc
dataloader2.py		dataloader2.py
evaluate.py		evaluate.py
opt.py		opt.py
opt.pyc		opt.pyc
prepro_vocab.py		prepro_vocab.py
read_binary_blob.py		read_binary_blob.py
readme.txt		readme.txt
requirements.txt		requirements.txt
rough.py		rough.py
test.py		test.py
test_beam.py		test_beam.py
train.py		train.py
train_data_full.py		train_data_full.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CONVOLUTIONAL VIDEO CAPTIONING

Video Captioning using CNN decoder

Abstract:

Architecture

Results

About

Releases

Packages

Contributors 2

Languages

sreedattaSanjay/Convolutional-Video-Captioning

Folders and files

Latest commit

History

Repository files navigation

CONVOLUTIONAL VIDEO CAPTIONING

Video Captioning using CNN decoder

Abstract:

Architecture

Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages