Skip to content

Vamsi995/Reward-Propagation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Reward Progpagation using Graph Convolutional Networks in GridWorld

The repository contains the code for running the experiments on sparse reward environments in 2D-Gridworld, based on the paper Reward Propagation using Graph Convolutional Networks using the Proto Value Functions by Mahadevan and Maggioni as features to the GCN. The underlying MDP of the Gridworld is captured as a graph whihc is then used to calculate the Proto Value Functions.The implementation is GCN is baseed on Thomas Kipf's pytorch GCN implementation. The environment currently is only a GridWorld and all the results have been produced using this environment. The actor critic network implementation was not from any library but our own implementation using linear function approximators.

Getting Started

For a quick start clone the repository, and type the following command.

$ git clone <repo link>
$ python main.py 

Installation

# PyTorch
conda install pytorch torchvision -c soumith

# Other requirements
pip install -r requirements.txt

#Installing PyGCN
python setup_gcn.py install

Usage

For GridWorld Implementation

python main.py --env_dim 5 5 --gcn_epochs 100 --gcn_lambda 10 --gcn_alpha 0.6 --episodes 2000

Results

Reward Propagation Regret Plot Loss Plot

Built With

Authors

Acknowledgments

  • Dr.Chandra Shekar Lakshminarayan, IIT Palakkad

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages