Skip to content

Curated list of services and platforms for serverless GPU and AI inference

Notifications You must be signed in to change notification settings

viktorfa/awesome-serverless-gpu

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 

Repository files navigation

Awesome Serverless GPU

List of where to run code on GPUs for AI, inference, predictions that are serverless.
Serverless is defined as pay-as-you-go, scale-to-zero, minimal infrastructure configuration.

Serverless GPU is a reletively new and fast evolving field. New services are appearing and disappearing frequently.
I will do my best to keep the list updated, and soon include benchmarks.

Common weaknessses of serverless GPU at the moment is very long cold starts, and configuration that are less easy to use than the more mature field of serverless on CPUs.

Inference

Bring your own model

True serverless inference

Predefined models

True serverless with a limited set of models

Not serverless inference

Needs dedicated server, but works with your own model

Dev on GPUs

Flexible on-demand GPU providers

Predefined models over API

Speech to text

Text to speech

Image generation

Workflow platforms

AI Agents for Websites

About

Curated list of services and platforms for serverless GPU and AI inference

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published