-
Notifications
You must be signed in to change notification settings - Fork 354
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
It seems dqn can't learn much #2
Comments
Yeah, guys. I'm trying to enhance the score using the A3C algorithm. If you have any improvement, please let me know! :) |
I'm applying the A3C algorithm on it. This is the baseline agent of the paper. |
Awesome! Will try it soon |
@chris-chris How's going with A3C? I see you changed the principle, is it getting any better? |
@Seraphli @ShadowDancer @vors @yilei I applied A2C algorithm. I think it works better.
|
I tried to run the code, and at some point the program threw out this error.
|
I'm having the same |
@davidkuhta @Seraphli I'll fix it! thanks!! |
@davidkuhta @Seraphli I fixed it. Can you guys check it out? |
Thanks @chris-chris! Running it now, will follow-up |
Ok, still ran into the same issue, I did see the initialization in the last commit: Here's how it ended:
|
Just in case anyone would like to look at a alternative work-in-progress implementation without openai-baselines dependency and complete action space: https://github.com/simonmeister/pysc2-rl-agents. |
Is there anyone who has encountered this error? TypeError: Can't instantiate abstract class SubprocVecEnv with abstract methods step_async, step_wait |
@mushroom1116 I change the pysc2-examples/common/vec_env/subproc_vec_env.py
to
and can run. |
I ran the script last night. It started with ~11 mean reward, and ended with ~15.5 mean reward.
I tried to play this mini-game myself, and I could get ~100 score or more.
Deepmind reached ~100 score in their video.
The text was updated successfully, but these errors were encountered: