Name		Name	Last commit message	Last commit date
parent directory ..
results		results
Project 1) Mario.ipynb		Project 1) Mario.ipynb
Project 2.1) Ant.ipynb		Project 2.1) Ant.ipynb
Project 2.2) Bipedal Walker.ipynb		Project 2.2) Bipedal Walker.ipynb
README.md		README.md

README.md

Project 1: Mario

Algorithm: PPO, ~55 hours, ~2M frames, ~2K games played; gamma = 0.98, reward function:

Reward	Coeff.
score	0.01
life loss	-5
move-to-the-right oracle	+0.01

Results: stuck in the middle of level 2 (?!) because of mushroom:

Fun fact: Average mushrooms per game when training with score reward raised to 0.8 from 0.1 of random policy. After stucking in this local optima, this plot dropped back to 0.1! No more eating mushrooms o_O

Yet the reward is now in plato, here is the last game played:

Good news: he can shoot turtles! (it gives Mario a lot of points). Bad news: he is stuck again because of long pit :(

Project 2.1: Ant (PyBullet)

Algorithm: TD3, ~13 hours, ~2M frames, ~2K games played;

Rendering is still an issue, but reward indicates that it worked.

Project 2.2: Bipedal Walker

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Demo Projects

Demo Projects

README.md

Project 1: Mario

Project 2.1: Ant (PyBullet)

Project 2.2: Bipedal Walker

Files

Demo Projects

Directory actions

More options

Directory actions

More options

Latest commit

History

Demo Projects

Folders and files

parent directory

README.md

Project 1: Mario

Project 2.1: Ant (PyBullet)

Project 2.2: Bipedal Walker