Skip to content

Commit

Permalink
move to new location / add readme
Browse files Browse the repository at this point in the history
  • Loading branch information
zhangchuheng123 committed Sep 5, 2018
1 parent 9224a70 commit ea36cac
Show file tree
Hide file tree
Showing 5 changed files with 20 additions and 0 deletions.
20 changes: 20 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,20 @@
# Reinforcement-Implementation

This project aims to reproduce the results of several model-free RL algorithms in continuous action domain (mujuco environment).

This projects
* uses pytorch package
* implements every algorithm independently in one file
* is written in simplest style
* tries to follow the original paper and reproduce their results

My first stage of work is to reproduce this figure in the PPO paper.

![](docs/ppo_experiments.png)

- [x] A2C
- [ ] ACER (A2C + Trust Region)
- [ ] CEM
- [x] TRPO (TRPO single path)
- [x] PPO (PPO clip)
- [ ] Vanilla PG, Adaptive
File renamed without changes.
File renamed without changes.
File renamed without changes.
Binary file added docs/ppo_experiments.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.

0 comments on commit ea36cac

Please sign in to comment.