A reference repository for implementations of some of the most used Reinforcement Learning algorithm for tutorial purposes.
- Q-Learning on MountainView-v0 Reference
Results (Average reward vs Num Episodes):
- SARSA
Results (Average reward vs Num Episodes):
To-Do:
- DQN
- DDPG
- A2C/A3C
- PPO
- TRPO