GitHub - TheAllen1996/Reinforcement_Learning_Notes: A naive version.

Reinforcement Learning Notes

(Update 2021.12.13) Source code is open now.

(Update 2021.01.11) More posts are available here.

The (introductory) notes include Bandit Algorithms, MDP, Model-free Methods, Value Function Approximation, Policy Optimization. For the state-of-the-art advances, one can refer to paper directly and some excellent blogs.

Hope you enjoy your learning.

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
rl_notes_source		rl_notes_source
README.md		README.md
Reinforcement Learning Notes.pdf		Reinforcement Learning Notes.pdf
Section 1 Introduction.pdf		Section 1 Introduction.pdf
Section 2 Probability.pdf		Section 2 Probability.pdf
Section 3 Bandit Algorithms.pdf		Section 3 Bandit Algorithms.pdf
Section 4 Markov Chains.pdf		Section 4 Markov Chains.pdf
Section 5 Markov Decision Process.pdf		Section 5 Markov Decision Process.pdf
Section 6 Model-Free Prediction.pdf		Section 6 Model-Free Prediction.pdf
Section 7 Model-Free Control.pdf		Section 7 Model-Free Control.pdf
Section 8 Value Function Approximation.pdf		Section 8 Value Function Approximation.pdf
Section 9 Policy Gradient.pdf		Section 9 Policy Gradient.pdf
Table of Contents.pdf		Table of Contents.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement Learning Notes

About

Releases

Packages

Languages

TheAllen1996/Reinforcement_Learning_Notes

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning Notes

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages