Skip to content

Commit

Permalink
Merge pull request #15 from TianyiQ/main
Browse files Browse the repository at this point in the history
fix(evaluations): increase memory util in inference
  • Loading branch information
TianyiQ authored Nov 16, 2024
2 parents c283c3c + 72ae548 commit 8da2220
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion src/abstractions/backends.py
Original file line number Diff line number Diff line change
Expand Up @@ -316,7 +316,7 @@ def vllm_process_batch(
)

with open(os.devnull, "w") as devnull:
frac_static = 0.8 if purpose == "responses" else 0.4
frac_static = 0.8 if purpose == "responses" else 0.7
prefill_size = 8192 if purpose == "responses" else 1024

model_size = get_model_size(model_repoid_or_path)
Expand Down

0 comments on commit 8da2220

Please sign in to comment.