Skip to content

Latest commit

 

History

History
70 lines (51 loc) · 3.32 KB

README.md

File metadata and controls

70 lines (51 loc) · 3.32 KB

1. Realtime-MLOps: A framework of open-source technologies to design real-time ml systems.

Realtime MLOps provides a cloud-native approach to design machine learning systems enabling the use of real-time. This setup enables and demonstrates real-time feature engineering, online inferencing, real-time monitoring and continual learning as a a matured Kubernetes-native MLOps platform.

2. Summary

The platform is divided into five modules to enable different aspects of a real-time mlops system.

  1. Kubernetes is used in conjunction with Github to provide an Infrastructure-as-Code enabled way of managing the platform.
  2. Strimzi Kafka Operator and KNative Eventing are used to provide and abstract streaming capabilities in a Kubernetes-native way. Stream Processing is accomplished by using Bytewax Dataflows for various use-cases.
  3. Feast is used to create a feature store to centralize features and provide access to real-time and historical features. Feast is used to implement a redis online store and a postgresql offline store.
  4. To orchestrate modular ml pipelines tekton is used alongside with mlflow to store experiment metadata and model artifacts. Tekton enables reusable event triggered pipeline runs.
  5. Inferencing is done using KServe and Prometheus to allow model serving, model updating and model monitoring.

The following figure shows the frameworks composition of components and flow of information.

3. Quickstart

Download the repistory and create conda environment make env and activate conda activate realtime-mlops

Simply use make install to install all components and run the demonstration examples.

4. Access

Telepresence provides a convinient way to access internal Kubernetes services externally.

Web interfaces then are available at:

If the setup did run successfully but the web UIs are not available it may help to quit and connect telepresence client again.

telepresence quit
telepresence connect

5. Demonstration

Check out the demonstraion for a real-time ml use-case.

5.1. Real-time Feature Engineering

see Real-time Feature Engineering

5.2. Online Inferencing

see Online Inferencing

5.3. Continual Learning

see Continual Learning

5.4. Real-time Monitoring

see Real-time Monitoring