Skip to content
/ kubedl Public
forked from kubedl-io/kubedl

A unified operator for running deep learning/machine learning workloads on Kubernetes

License

Notifications You must be signed in to change notification settings

yhalpha/kubedl

 
 

License FOSSA Status KubeDL Action Status CII Best Practices


KubeDL enables deep learning workloads to run on Kubernetes more easily and efficiently.

KubeDL is a CNCF sandbox project.


Features

  • Support training and inferences workloads (Tensorflow, Pytorch. Mars etc.)in a single unified controller. Features include advanced scheduling, acceleration using cache, metadata persistentcy, file sync, enable service discovery for training in host network etc.
  • Automatically tunes the best container-level configurations before an ML model is deployed as inference services. - Morphling Github
  • Model lineage and versioning to track the history of a model natively in CRD: when the model is trained using which data and which image, each version of the model, which version is running etc.
  • Enables storing and versioning a model leveraging container images. Each model version is stored as its own image and can later be served with Serving framework.

Check the website: https://kubedl.io


Publications

KubeDL-Morphling paper accepted at ACM Socc 2021: Morphling: Fast, Near-Optimal Auto-Configuration for Cloud-Native Model Serving

License

FOSSA Status

About

A unified operator for running deep learning/machine learning workloads on Kubernetes

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Go 73.3%
  • JavaScript 24.4%
  • Less 1.1%
  • Python 0.3%
  • EJS 0.3%
  • Makefile 0.2%
  • Other 0.4%