adp_rl

Approximate Dynamic Programming and Reinforcement Learning - Programming Assignment

The purpose of this assignment is to implement a simple environment and learn to make optimal decisions inside a maze by solving the problem with Dynamic Programming. Value Iteration(VI) and Policy Iteration(PI) i.e. Policy Evaluation, Policy Improvement methods are implemented and analyzed.

Run the python main.py /absolute/path/to/maze.txt command to launch the application.