You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, I found that finetuning foundation-model-1x starts with a high loss (14.0) using find-cave contractor data.
As described in section E.3, when using hierarchical action space, loss for the secondary mouse movement should be masked during training.
However, I didn't find the implementation in behavior_cloning.py, which may be the reason why the fintuning loss is large.
Looking forward to your reply, thanks.
The text was updated successfully, but these errors were encountered:
phython96
changed the title
Missing loss mask with hierarchical action space in behavior cloning script
Missing loss mask with hierarchical action space in behavior cloning script?
Feb 28, 2023
Hmm can you pinpoint the phrase about "secondary mouse movement"? The action space should capture mouse movement regardless of the task/setup.
The high loss is also probably due to to different training setup; due to memory constraints, the BC example trains with very small sequences lengths (and one sample at a time). The behaviour of the players in the find-cave (or other BASALT task) is also significantly different from general Minecraft gameplay, so I am not that surprised the loss starts high.
Hi, I found that finetuning foundation-model-1x starts with a high loss (14.0) using find-cave contractor data.
As described in section E.3, when using hierarchical action space, loss for the secondary mouse movement should be masked during training.
However, I didn't find the implementation in behavior_cloning.py, which may be the reason why the fintuning loss is large.
Looking forward to your reply, thanks.
The text was updated successfully, but these errors were encountered: