Deep Multi-Agent Reinforcement Learning

AY2020/21 Sem 1/2 CP3209 UROP in Computing Project with Dr Jing Wei, IHPC.

To-do list

Get models working for speaker_listener, followed by the rest of the scenarios
Add discrete action space output option via Gumbel-Softmax reparameterization trick
Move noise parameter to inside the agent class
Add support for individual good/bad agent policies
Implement M3DDPG algorithm
Implement GIF saving for MPE
Implement policy estimation and esembling for MADDPG

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
deepmarl		deepmarl
experiments		experiments
.gitignore		.gitignore
README.md		README.md
custom_callbacks.py		custom_callbacks.py
evaluate.py		evaluate.py
main.py		main.py
requirements.txt		requirements.txt
utils.py		utils.py