You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Instead of a scalar noise, should be a vector of number of actions size
mu_prime = mu + T.tensor(np.random.normal(scale=self.noise,size=(self.n_actions,)),
The text was updated successfully, but these errors were encountered:
We're allowed to add a scalar quantity to a vector. Is there a reason why each component of the mu tensor should have a different random number added to it?
Youtube-Code-Repository/ReinforcementLearning/PolicyGradient/TD3/td3_torch.py
Line 163 in 733e452
Instead of a scalar noise, should be a vector of number of actions size
mu_prime = mu + T.tensor(np.random.normal(scale=self.noise,size=(self.n_actions,)),
The text was updated successfully, but these errors were encountered: