how to fine-tuning on a custom dataset #35

trkRingo · 2025-02-21T01:55:00Z

If I directly fine-tune based on the provided stage-3 pretrained weight, how many iterations and gpus are estimated to get good results? Are there any guidance or insights on parameter-efficient fine-tuning techniques?
In addition, how to reduce the vRAM requirement (48g)? The operations that have been tried: sp_size=8, enable vae_tiling, reduce image resolution and video frame.
Looking forward to your reply!

flymin · 2025-03-04T05:31:19Z

Our experiments on the Waymo dataset show that one may acquire usable results within 2000 iterations, starting from the stage 3 model, while more iterations further improve the quality and controllability.

The encoding may take too much memory on high resolution. If tiling does not help, you may try to generate latents offline and skip the VAE during training.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how to fine-tuning on a custom dataset #35

how to fine-tuning on a custom dataset #35

trkRingo commented Feb 21, 2025

flymin commented Mar 4, 2025

how to fine-tuning on a custom dataset #35

how to fine-tuning on a custom dataset #35

Comments

trkRingo commented Feb 21, 2025

flymin commented Mar 4, 2025