You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The code in this repo shows that baseline reward is calculated by averaging reward of generated captions. However, the original version of scst as well as some other scst implementation (e.g., in VALOR) calculate the baseline reward with greedy-search-generated caption. Is there any reference or explanation about current implementation in this repo? Really appreciate it if I obtain any help.
The text was updated successfully, but these errors were encountered:
The code in this repo shows that baseline reward is calculated by averaging reward of generated captions. However, the original version of scst as well as some other scst implementation (e.g., in VALOR) calculate the baseline reward with greedy-search-generated caption. Is there any reference or explanation about current implementation in this repo? Really appreciate it if I obtain any help.
The text was updated successfully, but these errors were encountered: