Skip to content

Commit

Permalink
Merge pull request #41 from igooch/readiness
Browse files Browse the repository at this point in the history
Adds StartUp and Readiness Probe to HuggingFace TGI
  • Loading branch information
zmerlynn authored Mar 14, 2024
2 parents c9c3a42 + 9a34104 commit 2f954e2
Showing 1 changed file with 36 additions and 0 deletions.
36 changes: 36 additions & 0 deletions genai/language/huggingface_tgi/k8s.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -31,6 +31,18 @@ spec:
ports:
- containerPort: 80
image: ghcr.io/huggingface/text-generation-inference:1.4.2
startupProbe:
httpGet:
path: /health
port: 80
failureThreshold: 240
periodSeconds: 5
readinessProbe:
httpGet:
path: /health
port: 80
failureThreshold: 12
periodSeconds: 5
# Use this image for Gemma support:
# image: us-docker.pkg.dev/vertex-ai/vertex-vision-model-garden-dockers/pytorch-hf-tgi-serve:20240220_0936_RC01
args:
Expand Down Expand Up @@ -113,6 +125,18 @@ spec:
ports:
- containerPort: 80
image: ghcr.io/huggingface/text-generation-inference:1.4.2
startupProbe:
httpGet:
path: /health
port: 80
failureThreshold: 240
periodSeconds: 5
readinessProbe:
httpGet:
path: /health
port: 80
failureThreshold: 12
periodSeconds: 5
# Use this image for Gemma support:
# image: us-docker.pkg.dev/vertex-ai/vertex-vision-model-garden-dockers/pytorch-hf-tgi-serve:20240220_0936_RC01
args:
Expand Down Expand Up @@ -195,6 +219,18 @@ spec:
ports:
- containerPort: 80
image: ghcr.io/huggingface/text-generation-inference:1.4.2
startupProbe:
httpGet:
path: /health
port: 80
failureThreshold: 240
periodSeconds: 5
readinessProbe:
httpGet:
path: /health
port: 80
failureThreshold: 12
periodSeconds: 5
# Use this image for Gemma support:
# image: us-docker.pkg.dev/vertex-ai/vertex-vision-model-garden-dockers/pytorch-hf-tgi-serve:20240220_0936_RC01
args:
Expand Down

0 comments on commit 2f954e2

Please sign in to comment.