MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training

This repository is the official implementation of "MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training"

Setup

Environment

Install the packages in requirements.txt.

pip install -r requirements.txt

We test our code on Python 3.8.10 and PyTorch 2.1.0

Dependenicies

Run the script to download dependencies materials:

bash prepare/download_smpl_model.sh
bash prepare/prepare_clip.sh
bash prepare/download_t2m_evaluators.sh

Dataset

Please refer to HumanML3D for text-to-motion dataset setup. Copy the result dataset to our repository:

cp -r ../HumanML3D/HumanML3D ./datasets/humanml3d

Training

The training setup can be adjusted in the config files: *.yaml in /configs.

Traning Stage 1 (VAE+SAN):

Run the following script:

python train.py --cfg ./configs/config_stage1.yaml --cfg_assets ./configs/assets.yaml --nodebug

Training Stage 2(Conditional motion latent diffusion):

Run the following script:

python train.py  --cfg ./configs/config_stage2.yaml  --cfg_assets ./configs/assets.yaml  --nodebug

Evaluation

Please first put the tained model checkpoint path to TEST.CHECKPOINT in config_stage1.yaml and config_stage2.yaml.

Stage 1:

To evaluate reconstruction performance of stage 1 model, run the following command:

python test.py --cfg ./configs/config_stage1.yaml --cfg_assets ./configs/assets.yaml

Stage 2:

To evaluate motion generation performance of stage 1 model, run the following command:

python test.py --cfg ./configs/config_stage2.yaml --cfg_assets ./configs/assets.yaml

Visualizing generated samples

We support text file (for text-to-motion) and npy file(for control signal on motion editing) as input. The generated/edited motions are npy files.

Text-to-Motion

python visualize_test.py --cfg ./configs/config_stage2.yaml  --cfg_assets ./configs/assets.yaml --example ./demo/example.txt

Motion editing

python visualize_test.py --cfg ./configs/config_stage2.yaml  --cfg_assets ./configs/assets.yaml --example ./demo/example.txt --editing --control ./demo/control_example_start_end.npy

Citation

@article{uchida2024mola,
        title={MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training},
        author={Uchida, Kengo and Shibuya, Takashi and Takida, Yuhta and Murata, Naoki and Takahashi, Shusuke and Mitsufuji, Yuki},
        journal={arXiv preprint arXiv:2406.01867},
        year={2024}
      }

Reference

Part of the code is borrowed from the following repositories. We would like to thank the authors of these repos for their excellent work: MLD, HumanML3D, MPGD, SAN, T2M-GPT.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training

Setup

Environment

Dependenicies

Dataset

Training

Traning Stage 1 (VAE+SAN):

Training Stage 2(Conditional motion latent diffusion):

Evaluation

Stage 1:

Stage 2:

Visualizing generated samples

Text-to-Motion

Motion editing

Citation

Reference

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
configs		configs
demo		demo
mola		mola
prepare		prepare
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
test.py		test.py
train.py		train.py
visualize_test.py		visualize_test.py

License

sony/MoLA

Folders and files

Latest commit

History

Repository files navigation

MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training

Setup

Environment

Dependenicies

Dataset

Training

Traning Stage 1 (VAE+SAN):

Training Stage 2(Conditional motion latent diffusion):

Evaluation

Stage 1:

Stage 2:

Visualizing generated samples

Text-to-Motion

Motion editing

Citation

Reference

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages