Skip to content

Latest commit

 

History

History
51 lines (31 loc) · 977 Bytes

README.md

File metadata and controls

51 lines (31 loc) · 977 Bytes

BrunoSAC

Exchangeable Models in Meta Reinforcement Learning
I. Korshunova, J. Degrave, J. Dambre, A. Gretton, F. Huszár
Lifelong Learning Workshop at ICML 2020

Requirements

The code was used with the following settings:

  • python3
  • tensorflow-gpu==1.14.0
  • tensorflow-probability==0.7.0
  • gym==0.17.1
  • mujoco-py== 2.0.2.9
  • mujoco200

Training and testing

To train and then test BrunoSAC on Cheetah-Dir run:

python meta_cheetah_dir.py --train 
python meta_cheetah_dir.py --test

Similarly, for the oracle:

python meta_cheetah_dir.py --train --oracle
python meta_cheetah_dir.py --test --oracle

To plot the learning curves and test rewards:

python -m plots.plot_train_cheetah_dir 
python -m plots.plot_test_cheetah_dir

The same commands can be used with meta_cheetah_vel.py for the Cheetah-Vel experiments.

Questions?

Please send an email to [email protected], and I'll be happy to answer.