Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Training GPU requirement #50

Open
Sm0kyWu opened this issue Jun 11, 2024 · 3 comments
Open

Training GPU requirement #50

Sm0kyWu opened this issue Jun 11, 2024 · 3 comments

Comments

@Sm0kyWu
Copy link

Sm0kyWu commented Jun 11, 2024

Hi! Thanks for the amazing code!

I would like to ask about the requirement of training. Currently, I am using a single A100 with 40G RAM. My training code follows Moore. The problem is that no matter what video size I use (I even tried 64*64), it will be out of memory.

Could you please kindly share some information about training?

Thanks!

@TZYSJTU
Copy link
Contributor

TZYSJTU commented Jun 12, 2024

Hi! Thanks for the amazing code!

I would like to ask about the requirement of training. Currently, I am using a single A100 with 40G RAM. My training code follows Moore. The problem is that no matter what video size I use (I even tried 64*64), it will be out of memory.

Could you please kindly share some information about training?

Thanks!

8x80GB deepspeed zero2

@FangSen9000
Copy link

@Sm0kyWu If you use the training code for the mole thread, does it need some modification and approximately how much extra time will it take?

@Sm0kyWu
Copy link
Author

Sm0kyWu commented Jun 23, 2024

@FangSen9000 I haven't tried to train the Moore code. For Musepose you can directly use the training script from Moore. Single 40G A100 takes around 40 hours for 10000 steps of stage 1 (bs 6, 768x768, deepspeed zero2).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants