GitHub - lafmdp/TALAR: [NeurIPS'23] Official code for "Natural Language-conditioned Reinforcement Learning with Task-related Language Development and Translation", NeurIPS 2023.

Natural Language-conditioned Reinforcement Learning with Inside-out Task Language Development and Translation

This is official implementation of TALAR.

Setup

Please finish the following steps to install conda environment and related python packages
- Conda Environment create
```
conda create --name <env_name> --file spec-list.txt
```
- Package install
```
pip install -r requirements.txt
```
The environments used in this work require MuJoCo, CLEVR-Robot Environment and Bert as dependecies. Please setup them following the instructions:
- Instructions for MuJoCo: https://mujoco.org/
- Instructions for CLEVR-Robot Environment: https://github.com/google-research/clevr_robot_env
- Instructions for Bert: https://huggingface.co/bert-base-uncased. Move bert model to models directory.

Using

We upload our full dataset in https://drive.google.com/drive/folders/1p1r5swySbafnUVfAfOCQXZiuXF3D2s3F?usp=share_link , please download the dataset before using TALAR or collect your own data.

Training generator of TALAR in kitchen environments:

python train_tl_kitchen.py

Training translator of TALAR:

python train_translator_kitchen.py --path <path_to_model> --cpt-epoch 0

Training goal-conditioned-policy of TALAR:

Finish training generator and translator of TALAR
Move the translator model directory to code/models and rename it to code/models/policy

python train_gcp.py

The models of goal-conditioned-policy will be saved at gcp_model.
The tensorboard log of goal-conditioned-policy will be saved at gcp_train.
The evaluation result of goal-conditioned-policy will be saved at gcp_callback.

Citation

@inproceedings{talar/neurips/pang,
    author    = {Jing-Cheng Pang and Xinyu Yang and Si-Hang Yang and Xiong-Hui Chen andYang Yu},
    title     = {Natural Language-conditioned Reinforcement Learning with Task-related Language Development and Translation},
    booktitle = {Advances in Neural Information Processing Systems,
                {NeurIPS}},
    year      = {2023}}

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
GCP_utils		GCP_utils
algorithms		algorithms
envs		envs
kitchen		kitchen
stable_baselines3		stable_baselines3
utils		utils
README.md		README.md
baseline.py		baseline.py
high_baseline.py		high_baseline.py
hrl_kitchen_train.py		hrl_kitchen_train.py
kitchen_train.py		kitchen_train.py
requirements.txt		requirements.txt
spec-file.txt		spec-file.txt
train_tl_balls.py		train_tl_balls.py
train_tl_kitchen.py		train_tl_kitchen.py
train_translator_balls.py		train_translator_balls.py
train_translator_kitchen.py		train_translator_kitchen.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Natural Language-conditioned Reinforcement Learning with Inside-out Task Language Development and Translation

Setup

Using

Training generator of TALAR in kitchen environments:

Training translator of TALAR:

Training goal-conditioned-policy of TALAR:

Citation

About

Releases

Packages

Languages

lafmdp/TALAR

Folders and files

Latest commit

History

Repository files navigation

Natural Language-conditioned Reinforcement Learning with Inside-out Task Language Development and Translation

Setup

Using

Training generator of TALAR in kitchen environments:

Training translator of TALAR:

Training goal-conditioned-policy of TALAR:

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages