Name		Name	Last commit message	Last commit date
parent directory ..
bert-sample		bert-sample
docs		docs
logger		logger
torchserve-image		torchserve-image
README.md		README.md
autoscale.yaml		autoscale.yaml
canary.yaml		canary.yaml
custom-server-with-external-storage.md		custom-server-with-external-storage.md
gpu.yaml		gpu.yaml
metrics.yaml		metrics.yaml
pv.yaml		pv.yaml
pvc.yaml		pvc.yaml
pvpod.yaml		pvpod.yaml
torchserve-custom-pv.yaml		torchserve-custom-pv.yaml
torchserve-custom.yaml		torchserve-custom.yaml

README.md

Predict on a InferenceService using a Custom Torchserve Image

In this example we use torchserve as custom server to serve an mnist model. The idea of using torchserve as custom server is to make the transistion for new users from torchserve to kfserving easier.

Setup

Your ~/.kube/config should point to a cluster with KFServing installed.
Your cluster's Istio Ingress gateway must be network accessible.

This example requires v1beta1/KFS 0.5

Build and push the sample Docker Image

The custom torchserve image is wrapped with model inside the container and serves it with KFServing.

In this example we build a torchserve image with marfile and config.properties into a container. To build and push with Docker Hub, run these commands replacing {username} with your Docker Hub username:

Refer steps for building and publishing docker image.

Create the InferenceService

In the torchserve-custom.yaml file edit the container image and replace {username} with your Docker Hub username.

Apply the CRD

kubectl apply -f torchserve-custom.yaml

Expected Output

$inferenceservice.serving.kubeflow.org/torchserve-custom created

Run a prediction

The first step is to determine the ingress IP and ports and set INGRESS_HOST and INGRESS_PORT

Download input image:

wget https://raw.githubusercontent.com/pytorch/serve/master/examples/image_classifier/mnist/test_data/0.png

MODEL_NAME=torchserve-custom
SERVICE_HOSTNAME=$(kubectl get inferenceservice ${MODEL_NAME} -n <namespace> -o jsonpath='{.status.url}' | cut -d "/" -f 3)

curl -v -H "Host: ${SERVICE_HOSTNAME}" http://${INGRESS_HOST}:${INGRESS_PORT}/predictions/mnist -T 0.png

Expected Output

*   Trying 52.89.19.61...
* Connected to a881f5a8c676a41edbccdb0a394a80d6-2069247558.us-west-2.elb.amazonaws.com (52.89.19.61) port 80 (#0)
> PUT /predictions/mnist HTTP/1.1
> Host: torchserve-custom.kfserving-test.example.com
> User-Agent: curl/7.47.0
> Accept: */*
> Content-Length: 272
> Expect: 100-continue
>
< HTTP/1.1 100 Continue
* We are completely uploaded and fine
< HTTP/1.1 200 OK
< cache-control: no-cache; no-store, must-revalidate, private
< content-length: 1
< date: Fri, 23 Oct 2020 13:01:09 GMT
< expires: Thu, 01 Jan 1970 00:00:00 UTC
< pragma: no-cache
< x-request-id: 8881f2b9-462e-4e2d-972f-90b4eb083e53
< x-envoy-upstream-service-time: 5018
< server: istio-envoy
<
* Connection #0 to host a881f5a8c676a41edbccdb0a394a80d6-2069247558.us-west-2.elb.amazonaws.com left intact
0

For Autoscaling

Configurations for autoscaling pods Auto scaling

Canary Rollout

Configurations for canary Canary Deployment

Log aggregation

Follow the link for torchserve log aggregation in kubernetes. Log aggregation with EFK Stack

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

torchserve

torchserve

README.md

Predict on a InferenceService using a Custom Torchserve Image

Setup

This example requires v1beta1/KFS 0.5

Build and push the sample Docker Image

Create the InferenceService

Run a prediction

For Autoscaling

Canary Rollout

Log aggregation

Files

torchserve

Directory actions

More options

Directory actions

More options

Latest commit

History

torchserve

Folders and files

parent directory

README.md

Predict on a InferenceService using a Custom Torchserve Image

Setup

This example requires v1beta1/KFS 0.5

Build and push the sample Docker Image

Create the InferenceService

Run a prediction

For Autoscaling

Canary Rollout

Log aggregation