noise size equal to number of actions #33

SarodYatawatta · 2021-03-01T11:56:56Z

Youtube-Code-Repository/ReinforcementLearning/PolicyGradient/TD3/td3_torch.py

Line 163 in 733e452

mu_prime = mu + T.tensor(np.random.normal(scale=self.noise),

Instead of a scalar noise, should be a vector of number of actions size
mu_prime = mu + T.tensor(np.random.normal(scale=self.noise,size=(self.n_actions,)),

philtabor · 2021-08-03T16:20:26Z

We're allowed to add a scalar quantity to a vector. Is there a reason why each component of the mu tensor should have a different random number added to it?

SarodYatawatta · 2021-08-03T19:21:47Z

True, but by making mu tensor perturb by different random numbers, you can increase the exploration (as opposed to using the same random number)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

noise size equal to number of actions #33

noise size equal to number of actions #33

SarodYatawatta commented Mar 1, 2021

philtabor commented Aug 3, 2021

SarodYatawatta commented Aug 3, 2021

noise size equal to number of actions #33

noise size equal to number of actions #33

Comments

SarodYatawatta commented Mar 1, 2021

philtabor commented Aug 3, 2021

SarodYatawatta commented Aug 3, 2021