Reinforcement Learning Summer School 2017

To compile the tile-coding library:

cd tile-coding
make
export PYTHONPATH=$PYTHONPATH:pathto/tile-coding

300+ Notebooks from McGill COMP-767, Intro to RL

A collections of notebooks written by the students of McGill COMP-767, Intro to RL. We had a "bring your own assignment" model in which the students would create their own "assignment" related to the course material. The assignments would generally take the form of a Jupyter notebook exploring some questions empirically and/or theoretically.

With no particular order, a few awesome notebooks :

Wrapper to Marlos Machado's Linear Features for ALE

The instructions are provided in the README.md in :

shallowpy

The example code relies on memory overcommitment which is rather useful to know about. The overcommit mode can be read/set via cat /proc/sys/vm/overcommit_memory. From the Kernel documentation :

1 - Always overcommit. Appropriate for some scientific applications. Classic example is code using sparse arrays and just relying on the virtual memory consisting almost entirely of zero pages.

Other References

OpenAI Baselines
The original Mountain Car code written by Richard Sutton Mountain Car Software
The tile coding library used by the RLAI Tile Coding Software

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
tile-coding		tile-coding
README.md		README.md
Sarsa Function Approximation.ipynb		Sarsa Function Approximation.ipynb
Tabular Control.ipynb		Tabular Control.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement Learning Summer School 2017

300+ Notebooks from McGill COMP-767, Intro to RL

Wrapper to Marlos Machado's Linear Features for ALE

Other References

About

Releases

Packages

Languages

pierrelux/rlss2017

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning Summer School 2017

300+ Notebooks from McGill COMP-767, Intro to RL

Wrapper to Marlos Machado's Linear Features for ALE

Other References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages