-
Notifications
You must be signed in to change notification settings - Fork 1.5k
Pull requests: triton-inference-server/server
Author
Label
Milestones
Reviews
Assignee
Sort
Pull requests list
[build] Bumping vllm version to 0.7.0
build
Issues pertaining to builds
#7978
opened Jan 28, 2025 by
oandreeva-nv
Loading…
6 of 20 tasks
ci: Increase batch size for L0_openai_trtllm test
#7975
opened Jan 28, 2025 by
krishung5
Loading…
20 tasks
build: Dockerfiles generated by build.py for RHEL require base image specification
PR: build
Changes that affect the build system or external dependencies
#7970
opened Jan 27, 2025 by
saturley-hall
Loading…
6 of 20 tasks
draft: feat: Add graceful shutdown timer to GRPC frontend
enhancement
New feature or request
grpc
Related to the GRPC server
#7969
opened Jan 27, 2025 by
mattwittwer
•
Draft
5 of 20 tasks
Separate model generation for backends on blackwell clusters
#7966
opened Jan 24, 2025 by
pvijayakrish
Loading…
3 of 20 tasks
ci: Temporary disable KServe Python tests that may fail
PR: ci
Changes to our CI configuration files and scripts
#7949
opened Jan 17, 2025 by
kthui
Loading…
9 of 20 tasks
docs: update to fix autoscaling example command
#7883
opened Dec 16, 2024 by
mattwittwer
•
Draft
20 tasks
feat: ORCA Format KV Cache Utilization in Inference Response Header
#7839
opened Nov 27, 2024 by
BenjaminBraunDev
Loading…
12 of 22 tasks
refactor: Refactor of L0_backend_python and the env subtest
PR: ci
Changes to our CI configuration files and scripts
PR: refactor
A code change that neither fixes a bug nor adds a feature
#7838
opened Nov 27, 2024 by
nv-kmcgill53
•
Draft
5 of 20 tasks
ci: Enables testing for pull requests
#7828
opened Nov 23, 2024 by
pranavm-nvidia
Loading…
3 of 20 tasks
test: Updates L0 Python API tests to run all test files
#7827
opened Nov 23, 2024 by
pranavm-nvidia
Loading…
4 of 20 tasks
fix: Default max tokens to None for OpenAI frontend.
#7819
opened Nov 20, 2024 by
thealmightygrant
Loading…
4 of 22 tasks
feat: Adding RestrictedFeatures Support to the Python Frontend Bindings
#7775
opened Nov 8, 2024 by
KrishnanPrash
Loading…
docs: Add clarification for label_filename in classification docs
#7766
opened Nov 5, 2024 by
trevoryao
Loading…
7 of 22 tasks
docs: Simplify PR templates
PR: docs
Documentation only changes
#7753
opened Oct 29, 2024 by
yinggeh
Loading…
6 of 11 tasks
[Do not merge!] Build: Remove TRT model generation for V100
#7712
opened Oct 16, 2024 by
pvijayakrish
•
Draft
3 of 20 tasks
fix:Split L0_nomodel_perf into 2 test to ensure better debug-ability and resource util for PA
#7705
opened Oct 15, 2024 by
indrajit96
•
Draft
6 of 19 tasks
test: TC for Metric P0 nv_load_time per model
#7697
opened Oct 14, 2024 by
indrajit96
Loading…
8 of 20 tasks
Build: Update TRT release branch referenced in model gen file
#7693
opened Oct 11, 2024 by
pvijayakrish
Loading…
3 of 20 tasks
Build: Update README and versions for 24.10
#7686
opened Oct 8, 2024 by
pvijayakrish
Loading…
3 of 20 tasks
[DO NOT MERGE] Build: Update Readme and versions for 24.09
#7607
opened Sep 7, 2024 by
pvijayakrish
•
Draft
3 of 20 tasks
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.