How much GPU memory required to run live streaming demo? #5

WangyiNTU · 2025-02-10T05:16:48Z

Thanks for your wonderful framework. May I know how much GPU memory is required to run a live-streaming demo? Do you have a small pre-trained model for low-memory inferencing? Thanks.

hyf015 · 2025-02-10T07:04:30Z

Hi, thank you for your interest in our work! If generation is not required, Vince needs 18G GPU memory. If the generation module is loaded, it will need extra 6G GPU memory, in total a little more than 24G.

In our experiments, we run Vinci for live-streaming demo on one 4090 GPU.

WangyiNTU · 2025-02-11T02:24:06Z

Hi, thank you for your interest in our work! If generation is not required, Vince needs 18G GPU memory. If the generation module is loaded, it will need extra 6G GPU memory, in total a little more than 24G.

In our experiments, we run Vinci for live-streaming demo on one 4090 GPU.

Thanks for your prompt reply. Is there an alternate solution if I want to run it on a GPU less than 18G? Like InternLM2.5-1.8B as fixed LLM?

hyf015 · 2025-02-17T02:40:05Z

Thank you for your suggestion! In fact, we originally used smaller models but the performance did not seem optimal. We will add this option to our code soon, within this week. I will notify you in this thread once it is done.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How much GPU memory required to run live streaming demo? #5

How much GPU memory required to run live streaming demo? #5

WangyiNTU commented Feb 10, 2025

hyf015 commented Feb 10, 2025

WangyiNTU commented Feb 11, 2025

hyf015 commented Feb 17, 2025

How much GPU memory required to run live streaming demo? #5

How much GPU memory required to run live streaming demo? #5

Comments

WangyiNTU commented Feb 10, 2025

hyf015 commented Feb 10, 2025

WangyiNTU commented Feb 11, 2025

hyf015 commented Feb 17, 2025