robot_learning_project/paper_ideas.md at main · HansalShah007/robot_learning_project · GitHub

Initial attempts with Q-Learning and why it wasn't able to solve.
DQN unable to solve majority of time.
Changes to action space
Adding Reward Shaping
PerformanceBasedEpsilonCallback
Reasons to switch to A2c
Vanialla model performance vs parameter changes / wrappers