Question about baseline reward in `caption_mplug_scst.py` #9

czy-orange · 2023-12-21T05:53:33Z

The code in this repo shows that baseline reward is calculated by averaging reward of generated captions. However, the original version of scst as well as some other scst implementation (e.g., in VALOR) calculate the baseline reward with greedy-search-generated caption. Is there any reference or explanation about current implementation in this repo? Really appreciate it if I obtain any help.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about baseline reward in `caption_mplug_scst.py` #9

Question about baseline reward in `caption_mplug_scst.py` #9

czy-orange commented Dec 21, 2023

Question about baseline reward in caption_mplug_scst.py #9

Question about baseline reward in caption_mplug_scst.py #9

Comments

czy-orange commented Dec 21, 2023

Question about baseline reward in `caption_mplug_scst.py` #9

Question about baseline reward in `caption_mplug_scst.py` #9