Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

how to fine-tuning on a custom dataset #35

Open
trkRingo opened this issue Feb 21, 2025 · 1 comment
Open

how to fine-tuning on a custom dataset #35

trkRingo opened this issue Feb 21, 2025 · 1 comment

Comments

@trkRingo
Copy link

  1. If I directly fine-tune based on the provided stage-3 pretrained weight, how many iterations and gpus are estimated to get good results? Are there any guidance or insights on parameter-efficient fine-tuning techniques?
  2. In addition, how to reduce the vRAM requirement (48g)? The operations that have been tried: sp_size=8, enable vae_tiling, reduce image resolution and video frame.
    Looking forward to your reply!
@flymin
Copy link
Owner

flymin commented Mar 4, 2025

Our experiments on the Waymo dataset show that one may acquire usable results within 2000 iterations, starting from the stage 3 model, while more iterations further improve the quality and controllability.

The encoding may take too much memory on high resolution. If tiling does not help, you may try to generate latents offline and skip the VAE during training.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants