Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

noise size equal to number of actions #33

Open
SarodYatawatta opened this issue Mar 1, 2021 · 2 comments
Open

noise size equal to number of actions #33

SarodYatawatta opened this issue Mar 1, 2021 · 2 comments

Comments

@SarodYatawatta
Copy link

mu_prime = mu + T.tensor(np.random.normal(scale=self.noise),

Instead of a scalar noise, should be a vector of number of actions size
mu_prime = mu + T.tensor(np.random.normal(scale=self.noise,size=(self.n_actions,)),

@philtabor
Copy link
Owner

We're allowed to add a scalar quantity to a vector. Is there a reason why each component of the mu tensor should have a different random number added to it?

@SarodYatawatta
Copy link
Author

True, but by making mu tensor perturb by different random numbers, you can increase the exploration (as opposed to using the same random number)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants