Skip to content

Commit

Permalink
Update helm-charts/chatqna/gaudi-values.yaml
Browse files Browse the repository at this point in the history
Co-authored-by: Eero Tamminen <[email protected]>
  • Loading branch information
yongfengdu and eero-t authored Nov 8, 2024
1 parent 0cf5dfc commit 01bfa66
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions helm-charts/chatqna/gaudi-values.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -13,6 +13,7 @@ tgi:
resources:
limits:
habana.ai/gaudi: 1
# higher limits are needed with extra input tokens added by rerank
MAX_INPUT_LENGTH: "2048"
MAX_TOTAL_TOKENS: "4096"
CUDA_GRAPHS: ""
Expand Down

0 comments on commit 01bfa66

Please sign in to comment.