Separate inference 720p video with 24G VRAM #597

narrowsnap · 2024-07-10T05:17:27Z

Add VAE encoder for reference.

Reduce inference VRAM by separate process:

Run text_encoder and save text embeding.
Run VAE encoder if reference_path in prompt.(optional)
Run STDiT with saved text embeding and save latents.
Run VAE decoder with saved latents.

FrankLeeeee · 2024-07-12T06:54:04Z

README.md

+### Separate Inference 720p video with 24G VRAM
+```bash
+# text to video
+./scripts/separate_inference.sh 4s 720p "9:16" "a beautiful waterfall"
+```
+
+```bash
+# image to video
+./scripts/separate_inference.sh 4s 720p "9:16" "a beautiful waterfall. {\"reference_path\": \"path2reference.png\",\"mask_strategy\": \"0\"}""
+```
+


I am aware of your motivation, but can you add more doc to tell other users why, when and how to run inference separately so that they can feel more guided?

FrankLeeeee · 2024-07-12T07:02:08Z

scripts/separate_inference.sh

+
+set_default_params "$@"
+
+CUDA_VISIBLE_DEVICES=0,1 torchrun  --nproc_per_node 2 --master_port=23456 scripts/separate_inference/inference_text_encoder.py configs/opensora-v1-2/inference/sample.py --aes 7 --num-frames "$num_frames" --resolution "$resolution" --aspect-ratio "$aspect_ratio" --prompt "$prompt"


This will by default use 2 GPUs, can you make this configurable via bash argument as well?

tpc2233 · 2024-08-14T14:52:01Z

I tried you 24gb vram code @narrowsnap but im getting the issue on inference_stdit.py with caption_embs=caption_embs, caption_emb_masks=caption_emb_masks where fails with AttributeError: 'NoneType' object has no attribute 'encode' during inference. Rest of steps seams ok.

narrowsnap · 2024-08-15T01:52:09Z

caption_emb_masks

Are you update the code of RFLOW?(opensora/schedulers/rf/__init__.py)

tpc2233 · 2024-08-15T09:46:07Z

caption_emb_masks

Are you update the code of RFLOW?(opensora/schedulers/rf/init.py)

Yes, i even tried git clone your fork, but no luck. This one right?
https://github.com/narrowsnap/Open-Sora/blob/main/opensora/schedulers/rf/__init__.py

narrowsnap · 2024-08-15T09:49:18Z

caption_emb_masks

Are you update the code of RFLOW?(opensora/schedulers/rf/init.py)

Yes, i even tried git clone your fork, but no luck. This one right? https://github.com/narrowsnap/Open-Sora/blob/main/opensora/schedulers/rf/__init__.py

This is wrong! You need to use feature/720p_for_16g branch.

tpc2233 · 2024-08-15T09:54:46Z

This is wrong! You need to use feature/720p_for_16g branch.

Sorry that is what i meant. Yes using from there:
https://github.com/narrowsnap/Open-Sora/blob/feature/720p_for_16g/opensora/schedulers/rf/__init__.py

It gets:
TypeError: sample() got an unexpected keyword argument 'caption_embs'
scripts/separate_inference/inference_stdit.py FAILED
Resulting on
FileNotFoundError: [Errno 2] No such file or directory: './samples/samples/2024-08-15/00002/0_0_latents.pt'

Also, many thanks for the quick replies, much appreaciated

narrowsnap · 2024-08-15T10:00:42Z

This is wrong! You need to use feature/720p_for_16g branch.

Sorry that is what i meant. Yes using from there: https://github.com/narrowsnap/Open-Sora/blob/feature/720p_for_16g/opensora/schedulers/rf/__init__.py

It gets: TypeError: sample() got an unexpected keyword argument 'caption_embs' scripts/separate_inference/inference_stdit.py FAILED Resulting on FileNotFoundError: [Errno 2] No such file or directory: './samples/samples/2024-08-15/00002/0_0_latents.pt'

Also, many thanks for the quick replies, much appreaciated

What is the command you used?

tpc2233 · 2024-08-15T10:02:45Z

This is wrong! You need to use feature/720p_for_16g branch.

Sorry that is what i meant. Yes using from there: https://github.com/narrowsnap/Open-Sora/blob/feature/720p_for_16g/opensora/schedulers/rf/__init__.py
It gets: TypeError: sample() got an unexpected keyword argument 'caption_embs' scripts/separate_inference/inference_stdit.py FAILED Resulting on FileNotFoundError: [Errno 2] No such file or directory: './samples/samples/2024-08-15/00002/0_0_latents.pt'
Also, many thanks for the quick replies, much appreaciated

What is the command you used?

From root on the fork, im just running: bash ./scripts/separate_inference.sh

narrowsnap · 2024-08-15T11:00:43Z

This is wrong! You need to use feature/720p_for_16g branch.

Sorry that is what i meant. Yes using from there: https://github.com/narrowsnap/Open-Sora/blob/feature/720p_for_16g/opensora/schedulers/rf/__init__.py
It gets: TypeError: sample() got an unexpected keyword argument 'caption_embs' scripts/separate_inference/inference_stdit.py FAILED Resulting on FileNotFoundError: [Errno 2] No such file or directory: './samples/samples/2024-08-15/00002/0_0_latents.pt'
Also, many thanks for the quick replies, much appreaciated

What is the command you used?

From root on the fork, im just running: bash ./scripts/separate_inference.sh

I can successfully run it. According to the error you showed, I suggest you check if there is caption_embs in your code.[opensora/schedulers/rf/init.py line 45]

tpc2233 · 2024-08-15T14:10:17Z

I can successfully run it. According to the error you showed, I suggest you check if there is caption_embs in your code.[opensora/schedulers/rf/init.py line 45]

Made work:) solution was delete all the instalations and cond env and use only your fork to intall. After you said was working, i tried delete the rf/init and still got the same issue, so i think was some type of caching or something referecing to the original instalation. After all re-installs. Worked. many thanks for quick replies and help @narrowsnap Great work.

Luke100000 · 2024-11-15T13:05:25Z

Hi, is it possible to further squash VRAM usage to get it running on 12GB? :)
Right now, the T5 encoder has the highest spike. Running it on CPU (only the text encoder) allows me to generate a 720p 3s video. And smaller gens would fit into 8GB just fine as well.

Zhouyang added 3 commits July 10, 2024 13:13

separate inference to reduce VRAM

e58d7ec

add inference_vae_encoder for refenrece

4588b39

udpate readme

8dffcaa

FrankLeeeee reviewed Jul 12, 2024

View reviewed changes

FrankLeeeee mentioned this pull request Jul 12, 2024

Added cpu offload to enable full length 720p on a 4090. #546

Open

FrankLeeeee reviewed Jul 12, 2024

View reviewed changes

Zhouyang added 2 commits July 12, 2024 15:58

add explanation and more args

787a5ea

update Separate Inference

368d0b8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Separate inference 720p video with 24G VRAM #597

Separate inference 720p video with 24G VRAM #597

narrowsnap commented Jul 10, 2024 •

edited

Loading

FrankLeeeee Jul 12, 2024

narrowsnap Jul 12, 2024 •

edited

Loading

FrankLeeeee Jul 12, 2024

tpc2233 commented Aug 14, 2024

narrowsnap commented Aug 15, 2024 •

edited

Loading

tpc2233 commented Aug 15, 2024

narrowsnap commented Aug 15, 2024

tpc2233 commented Aug 15, 2024

narrowsnap commented Aug 15, 2024

tpc2233 commented Aug 15, 2024

narrowsnap commented Aug 15, 2024

tpc2233 commented Aug 15, 2024

Luke100000 commented Nov 15, 2024


		set_default_params "$@"

		CUDA_VISIBLE_DEVICES=0,1 torchrun --nproc_per_node 2 --master_port=23456 scripts/separate_inference/inference_text_encoder.py configs/opensora-v1-2/inference/sample.py --aes 7 --num-frames "$num_frames" --resolution "$resolution" --aspect-ratio "$aspect_ratio" --prompt "$prompt"

Separate inference 720p video with 24G VRAM #597

Are you sure you want to change the base?

Separate inference 720p video with 24G VRAM #597

Conversation

narrowsnap commented Jul 10, 2024 • edited Loading

Add VAE encoder for reference.

FrankLeeeee Jul 12, 2024

Choose a reason for hiding this comment

narrowsnap Jul 12, 2024 • edited Loading

Choose a reason for hiding this comment

FrankLeeeee Jul 12, 2024

Choose a reason for hiding this comment

tpc2233 commented Aug 14, 2024

narrowsnap commented Aug 15, 2024 • edited Loading

tpc2233 commented Aug 15, 2024

narrowsnap commented Aug 15, 2024

tpc2233 commented Aug 15, 2024

narrowsnap commented Aug 15, 2024

tpc2233 commented Aug 15, 2024

narrowsnap commented Aug 15, 2024

tpc2233 commented Aug 15, 2024

Luke100000 commented Nov 15, 2024

narrowsnap commented Jul 10, 2024 •

edited

Loading

narrowsnap Jul 12, 2024 •

edited

Loading

narrowsnap commented Aug 15, 2024 •

edited

Loading