Skip to content

Commit

Permalink
added changes from pr 245
Browse files Browse the repository at this point in the history
  • Loading branch information
tthakkal committed Nov 12, 2024
1 parent c345c73 commit 0346e15
Showing 1 changed file with 2 additions and 0 deletions.
2 changes: 2 additions & 0 deletions server/text_generation_server/models/causal_lm.py
Original file line number Diff line number Diff line change
Expand Up @@ -497,6 +497,8 @@ def from_pb(

max_new_tokens = max(r.stopping_criteria.max_new_tokens for r in requests)
max_input_length = max_truncation
if max_input_length < PAD_SEQUENCE_TO_MULTIPLE_OF:
max_input_length = PAD_SEQUENCE_TO_MULTIPLE_OF
# TODO: by tokenizing all inputs at once we loose information on actual input lengths
# this means that we cannot shift inputs to the left after a long input sequence
# was filtered out
Expand Down

0 comments on commit 0346e15

Please sign in to comment.