Configuring localGPT for production #472

AnandMoorthy · 2023-09-13T08:01:07Z

AnandMoorthy
Sep 13, 2023

Hi,

I am planning to configure the project to production, i am expecting around 10 peoples to use this concurrently. My current setup is RTX 4090 with 24Gig memory. Flask app is working fine when a single user using localGPT but when multiple requests comes in at the same time the app is crashing.

Also i see whenever request comes in GPU goes 100%, is there any way with the current configuration i can server for around 10 peoples concurrently? Other suggestions are welcome too :)

Thanks!

AnandMoorthy · 2023-09-14T12:04:45Z

AnandMoorthy
Sep 14, 2023
Author

@PromtEngineer Need your input on this!

0 replies

matheus-mondaini · 2024-04-17T20:01:23Z

matheus-mondaini
Apr 17, 2024

@PromtEngineer It's the same issue that my team and I have

0 replies

PromtEngineer · 2024-05-03T06:03:13Z

PromtEngineer
May 3, 2024
Maintainer

@AnandMoorthy @matheus-mondaini we will need to implement queue in the api for multiple users, should be relatively easy to implement. Will have a look. Getting back to this project back soon.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Configuring localGPT for production #472

{{title}}

Replies: 3 comments

{{title}}

{{title}}

{{title}}

Select a reply

Configuring localGPT for production #472

AnandMoorthy Sep 13, 2023

Replies: 3 comments

AnandMoorthy Sep 14, 2023 Author

matheus-mondaini Apr 17, 2024

PromtEngineer May 3, 2024 Maintainer

AnandMoorthy
Sep 13, 2023

AnandMoorthy
Sep 14, 2023
Author

matheus-mondaini
Apr 17, 2024

PromtEngineer
May 3, 2024
Maintainer