The repository contains the code for running the experiments on sparse reward environments in 2D-Gridworld, based on the paper Reward Propagation using Graph Convolutional Networks using the Proto Value Functions by Mahadevan and Maggioni as features to the GCN. The underlying MDP of the Gridworld is captured as a graph whihc is then used to calculate the Proto Value Functions.The implementation is GCN is baseed on Thomas Kipf's pytorch GCN implementation. The environment currently is only a GridWorld and all the results have been produced using this environment. The actor critic network implementation was not from any library but our own implementation using linear function approximators.
For a quick start clone the repository, and type the following command.
$ git clone <repo link>
$ python main.py
# PyTorch
conda install pytorch torchvision -c soumith
# Other requirements
pip install -r requirements.txt
#Installing PyGCN
python setup_gcn.py install
python main.py --env_dim 5 5 --gcn_epochs 100 --gcn_lambda 10 --gcn_alpha 0.6 --episodes 2000
-
Alisetti Sai Vamsi - Vamsi995
-
Raswanth Murugan - RaswanthMurugan20
- Dr.Chandra Shekar Lakshminarayan, IIT Palakkad