Skip to content

craobhruadh/Reinforcement_Learning

Repository files navigation

Reinforcement_Learning

*Resources used

A Deep Reinforcement Learning Chatbot

Deep Reinforcement Learning for recommender systems:

Stanford CS 234: Reinforcement Learning

Berkeley CS 285:

Chapter 13 is on Policy Gradient Methods

David Silver's Deep Mind lectures are a good supplementary resource:

Textbooks:

Open AI:

https://spinningup.openai.com/en/latest/spinningup/rl_intro.html

This entire site is worth reading

Q learning:

Autonomous reinforcement learning from raw visual data, Lange & Riedmiller (2010) Q learning on top of latent space leared with autoencoder, uses fitted Q-iteration

"Human level control through deep reinforcement learning", Mnih et al (2013)

"Continous control with Deep Reinforcement Learning", Lillicrap et. al. (2015)

Classic papers

  • Watkins. (1989). Learning from delayed rewards: introduces Q-learning
  • Riedmiller. (2005). Neural fitted Q-iteration: batch-mode Q-learning with neural networks

Deep reinforcement learning Q-learning papers

Robots!

  • "Robotic manipulation with Deep Reinforcement Learning ant...", Gu, Holly, et. al. (2017)

  • "QT Opt: scalable Deep Reinforcement Learning of Vision-based Robotic Manipulation Skills". Kalashnikov, Irpan, Pastor

Recurrent models of visual attention:

Monte Carlo Tree Search:

  • Browne, Powley, Whitehouse, Lucas, Cowling, Rohlfshagen, Tavener, Perez, Samothrakis, Colton. (2012). A Survey of Monte Carlo Tree Search Methods

Fun exercises

Blackjack! Also talked about in the David Silver lectures and chapter 5 of Sutton and Barto https://www.davidsilver.uk/wp-content/uploads/2020/03/Easy21-Johannes.pdf

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published