You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
However, DeepQLearning.jl is always picking the full action space.
That's because the solve enumerates the actions once here, hands them into the policy, which are broadly used there after.
Do you think of a way to have action masking with the current implementation ?
The text was updated successfully, but these errors were encountered:
POMDPs.jl supports state-dependent action spaces
However, DeepQLearning.jl is always picking the full action space.
That's because the
solve
enumerates the actions once here, hands them into the policy, which are broadly used there after.Do you think of a way to have action masking with the current implementation ?
The text was updated successfully, but these errors were encountered: