Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Flat A3C agent #1

Open
dai-dao opened this issue Dec 24, 2017 · 2 comments
Open

Flat A3C agent #1

dai-dao opened this issue Dec 24, 2017 · 2 comments

Comments

@dai-dao
Copy link

dai-dao commented Dec 24, 2017

Hi,

I really like your work, and want to ask for some clarifications on your new observation on training a flat A3C agent without the meta-controller. In this case are the sub-goals randomly generated every 'c' timesteps? (instead of the meta-controller outputting the sub-goal)

Thanks,
Dai

@Nat-D
Copy link
Owner

Nat-D commented Dec 25, 2017

Hi Dai,

Yes, the sub-goals were randomly generated every c=100 time-steps. I also found that fixing the sub-goal to be just the first one also works in some seed. This only works with feature-control pseudo reward tho.

Best,
Nat

@nina124
Copy link

nina124 commented Dec 30, 2017

Does "randomly generated meta-action" work with pixel-control pseudo reward?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants