Skip to content

TheAllen1996/Reinforcement_Learning_Notes

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

35 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Reinforcement Learning Notes

(Update 2021.12.13) Source code is open now.

(Update 2021.01.11) More posts are available here.

The (introductory) notes include Bandit Algorithms, MDP, Model-free Methods, Value Function Approximation, Policy Optimization. For the state-of-the-art advances, one can refer to paper directly and some excellent blogs.

Hope you enjoy your learning.

About

A naive version.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages