Async-RL

This is a repository where I attempt to reproduce the results of Asynchronous Methods for Deep Reinforcement Learning. It's still work-in-progress and not so successfull compared to the original results.

Any feedback is welcome :)

Current Status

I trained A3C for ALE's Breakout with 8 processes for about 2 days and 5 hours. The scores of test runs along training are plotted below. One test run for every 100000 training steps (counted by the global shared counter).

You can make the trained model to play Breakout by the following command:

python demo_a3c_ale.py <path-to-breakout-rom> trained_model/breakout_48100000.h5

Some Hyperparameters

RMSprop
learning rate: initialize with 3.5e-4 (policy) and 7e-4 (value function) and linearly decrease to zero
epsilon: 0.1 (epsilon is inside sqrt)
alpha: 0.99

Requirements

Python 3.5.1
chainer 1.8.1
cached-property 1.3.0
h5py 2.5.0
Arcade-Learning-Environment

Training

python a3c_ale.py <number-of-processes> <path-to-atari-rom>

a3c_ale.py will save best-so-far models and test scores into the output directory.

Evaluation

python demo_a3c_ale.py <path-to-atari-rom> <trained-model>

Similar Projects

https://github.com/miyosuda/async_deep_reinforce

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
trained_model		trained_model
README.md		README.md
a3c.py		a3c.py
a3c_ale.py		a3c_ale.py
ale.py		ale.py
async.py		async.py
clipped_loss.py		clipped_loss.py
copy_param.py		copy_param.py
demo_a3c_ale.py		demo_a3c_ale.py
dqn_head.py		dqn_head.py
environment.py		environment.py
nonbias_weight_decay.py		nonbias_weight_decay.py
plot_scores.py		plot_scores.py
policy.py		policy.py
policy_output.py		policy_output.py
prepare_output_dir.py		prepare_output_dir.py
random_seed.py		random_seed.py
rmsprop_async.py		rmsprop_async.py
v_function.py		v_function.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Async-RL

Current Status

Some Hyperparameters

Requirements

Training

Evaluation

Similar Projects

About

Releases

Packages

Languages

nerdylinius/async-rl

Folders and files

Latest commit

History

Repository files navigation

Async-RL

Current Status

Some Hyperparameters

Requirements

Training

Evaluation

Similar Projects

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages