3D VAE finetune #111

Simona0212 · 2024-08-11T14:09:51Z

我们的科研项目想要在3D VAE的基础上进行改动和微调。我们尝试在原有模块的训练代码上进行修改，发现有点困难。
请问能否开源3D VAE模块的训练代码和配置文件？
感谢CogVideoX团队！

Simona0212 · 2024-08-11T14:53:50Z

I am going to modify and finetune the 3D VAE for our research project. We have tried to modify the training code of the original module and found it a little difficult. I would highly appreciate if you could provide the code and configuration files required to train and finetune the 3D VAE module (for video encoding and video decoding). We look forward to receiving feedback from your team. Thank you very much!

zRzRzRzRzRzRzR · 2024-08-30T06:02:17Z

Fine-tuning the VAE alone doesn’t seem very meaningful. If your goal is to fine-tune in conjunction with Transformers while reducing memory usage in the VAE encoder, we will continue working on this optimization. You can check for updates here.
#194 huggingface/diffusers#9302

serend1p1ty · 2024-10-11T08:23:43Z

@Simona0212 I recommend you use Open-Sora-Plan's code. I have trained a stronger VAE than CogVideoX using Open-Sora-Plan's codebase, along with many tricks.

vanche · 2024-11-18T04:04:13Z

@serend1p1ty I'm really interested in the tricks you've used and how much you've improved the VAE's performance. Could you share more details about the techniques you applied and the results you achieved?

serend1p1ty · 2024-11-19T03:18:20Z

@vanche Sorry I cannot share more tech detail. Here is my results.

There is still a lot of room for improvement (loss has not converged).

vanche · 2024-11-19T06:03:08Z

@serend1p1ty Thank you for sharing the performance results. Do you plan to publish a paper or technical report?

serend1p1ty · 2024-11-27T03:36:40Z

@vanche Basically, it's just some increment improvements, and unable to support a paper.

zRzRzRzRzRzRzR assigned tengjiayan20 Aug 11, 2024

zRzRzRzRzRzRzR mentioned this issue Aug 28, 2024

Work plan and enhancement / 工作计划和用户诉求 #194

Open

zRzRzRzRzRzRzR closed this as completed Sep 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

3D VAE finetune #111

3D VAE finetune #111

Simona0212 commented Aug 11, 2024 •

edited

Loading

Simona0212 commented Aug 11, 2024 •

edited

Loading

zRzRzRzRzRzRzR commented Aug 30, 2024

serend1p1ty commented Oct 11, 2024 •

edited

Loading

vanche commented Nov 18, 2024

serend1p1ty commented Nov 19, 2024

vanche commented Nov 19, 2024

serend1p1ty commented Nov 27, 2024

3D VAE finetune #111

3D VAE finetune #111

Comments

Simona0212 commented Aug 11, 2024 • edited Loading

Simona0212 commented Aug 11, 2024 • edited Loading

zRzRzRzRzRzRzR commented Aug 30, 2024

serend1p1ty commented Oct 11, 2024 • edited Loading

vanche commented Nov 18, 2024

serend1p1ty commented Nov 19, 2024

vanche commented Nov 19, 2024

serend1p1ty commented Nov 27, 2024

Simona0212 commented Aug 11, 2024 •

edited

Loading

Simona0212 commented Aug 11, 2024 •

edited

Loading

serend1p1ty commented Oct 11, 2024 •

edited

Loading