ReinforcementLearning

To run the cartpole example first create a new conda environment and run

pip install -e .
python cartpole_pytorch.py

Added code using pytorch lightning based on https://pytorch-lightning.readthedocs.io/en/latest/notebooks/lightning_examples/reinforce-learning-DQN.html

To run this do

python cartpole_pytorch_lightning.py

The design using pytorch lightning is a not completely clear to me as of writing. For one I do not know why there is a target net and how the target for MSE loss is computed. Secondly, because of the way LightningModule naturally works, instead of episodes we have epochs. In the simple pytorch example we would do a new episode when the environment was done and that way we could control number of episodes. Here on the other hand an epoch continues beyond the environment registering a done so the number of episodes is not fixed but rather the global number of training steps.

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
GameTheory		GameTheory
NotesFromSuttonAndBarto		NotesFromSuttonAndBarto
openai		openai
plotting		plotting
CartPole Pytorch.ipynb		CartPole Pytorch.ipynb
MountainCar Plots.ipynb		MountainCar Plots.ipynb
README.md		README.md
RL cartpole.ipynb		RL cartpole.ipynb
best_model_weights.pth		best_model_weights.pth
cartpole_pytorch.py		cartpole_pytorch.py
cartpole_pytorch_lightning.py		cartpole_pytorch_lightning.py
model_weights.pth		model_weights.pth
mountain_car.py		mountain_car.py
requirements.txt		requirements.txt
setup.py		setup.py
target_model_weights.pth		target_model_weights.pth

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ReinforcementLearning

About

Releases

Packages

Languages

borundev/ReinforcementLearning

Folders and files

Latest commit

History

Repository files navigation

ReinforcementLearning

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages