Flattened inf params #152

gkumbhat · 2023-08-29T16:51:41Z

Supports #155

Signed-off-by: gkumbhat <[email protected]>

alex-jw-brooks

This looks great! Some initial thoughts, but mostly just discussion points

caikit_nlp/modules/text_generation/peft_prompt_tuning.py

alex-jw-brooks · 2023-08-29T22:34:10Z

caikit_nlp/modules/text_generation/text_generation_local.py

+        top_p: Optional[float] = 0.0,
+        typical_p: Optional[float] = 0.0,
+        temperature: Optional[float] = 1.0,
+        repetition_penalty: Optional[float] = 0.0,


Same comments for repetition penalty and top_p here

alex-jw-brooks · 2023-08-29T22:42:13Z

caikit_nlp/toolkit/text_generation/model_run_utils.py

+        truncation = True
+
+    if repetition_penalty == 0.0:
+        repetition_penalty = 1.0


Ah, I see now - but why is this being overridden here rather than just using 1 as the default in .run?

Is top_p supposed to be handled outside of generate like this too?

This was to align with tgis, as per the comments in proto. https://github.com/caikit/caikit-tgis-backend/blob/main/caikit_tgis_backend/generation.proto#L103

caikit_nlp/toolkit/text_generation/model_run_utils.py

alex-jw-brooks · 2023-08-29T22:51:41Z

tests/modules/text_generation/test_peft_prompt_tuning.py

+        typical_p=0.23,
+        temperature=0.77,
+    )
+    assert isinstance(pred, GeneratedTextResult)


It might be nice to add some kind of validation to trace the input args to generate or something like that? Since these tests aren't actually validating that greedy / sampling etc are happening

Signed-off-by: gkumbhat <[email protected]>

tharapalanivel · 2023-08-30T16:52:07Z

caikit_nlp/toolkit/text_generation/model_run_utils.py

+    error.type_check("<NLP84635843E>", int, allow_none=True, top_k=top_k)
+    error.type_check("<NLP55267523E>", float, allow_none=True, top_p=top_p)
+    error.type_check("<NLP13670202E>", float, allow_none=True, typical_p=typical_p)
+    error.type_check(


type check for decoding_method, temperature, max_time and exponential_decay_length_penalty? And a value check on decoding_method something like this?

… run Signed-off-by: gkumbhat <[email protected]>

Signed-off-by: gkumbhat <[email protected]>

tharapalanivel · 2023-08-30T20:05:53Z

caikit_nlp/toolkit/text_generation/model_run_utils.py

+        Amount of time in seconds that the query should take maximum.
+        NOTE: this does not include network overhead.
+        Range: 0-120.0
+    exponential_decay_length_penalty: Tuple(int, float)


Type should also include ExponentialDecayLengthPenalty

gkumbhat added 10 commits August 25, 2023 15:45

✨ Add truncate_input_tokens to peft modules

1828440

Signed-off-by: gkumbhat <[email protected]>

🚧 Add new parameter name for run function for peft generation

abe965d

Signed-off-by: gkumbhat <[email protected]>

🚧 Add temperature support

58517fd

Signed-off-by: gkumbhat <[email protected]>

✅ Add some unit test for generate optional params

6fda5ac

Signed-off-by: gkumbhat <[email protected]>

♻️ Refactor run functions and move common stuff to utils

04e656d

Signed-off-by: gkumbhat <[email protected]>

🧑‍💻 Reorganize code better

05c6570

Signed-off-by: gkumbhat <[email protected]>

✨ Add stop seq, max_time and len penalty params

9ecbc08

Signed-off-by: gkumbhat <[email protected]>

🎨 Fix formatting

2281ec1

Signed-off-by: gkumbhat <[email protected]>

🎨 Fix linting

54c3f47

Signed-off-by: gkumbhat <[email protected]>

✨ Add seed param

57ce626

Signed-off-by: gkumbhat <[email protected]>

gkumbhat marked this pull request as ready for review August 29, 2023 19:57

gkumbhat requested review from alex-jw-brooks, evaline-ju and gabe-l-hart as code owners August 29, 2023 19:57

gkumbhat added 3 commits August 29, 2023 17:24

Merge branch 'main' into flattened_inf_params

f2d0165

🎨🐛 Fix linting, doc string variable issue and incomplete seed typecheck

6085b07

Signed-off-by: gkumbhat <[email protected]>

✨✅ Add support for exp decay len penalty object for run function

0cfc9e3

Signed-off-by: gkumbhat <[email protected]>

alex-jw-brooks reviewed Aug 29, 2023

View reviewed changes

gkumbhat added 4 commits August 29, 2023 18:03

💡 Fix test descriptions

4accfb3

Signed-off-by: gkumbhat <[email protected]>

📦 Update caikit to 0.18.0 to pull in input tokens count changes

6ca3d60

Signed-off-by: gkumbhat <[email protected]>

✨ Add support for input_token_count in response for run runctions

2b87121

Signed-off-by: gkumbhat <[email protected]>

🔧 Make defalult configuraiton change per review suggestions

f3a8270

Signed-off-by: gkumbhat <[email protected]>

tharapalanivel reviewed Aug 30, 2023

View reviewed changes

gkumbhat added 2 commits August 30, 2023 13:19

♻️ Refactor streaming API to also expose same params as non-streaming…

69cfad6

… run Signed-off-by: gkumbhat <[email protected]>

🎨 Fix formatting

295f9f7

Signed-off-by: gkumbhat <[email protected]>

tharapalanivel reviewed Aug 30, 2023

View reviewed changes

alex-jw-brooks approved these changes Aug 31, 2023

View reviewed changes

gkumbhat merged commit 4db0c26 into caikit:main Aug 31, 2023
4 checks passed

gkumbhat deleted the flattened_inf_params branch August 31, 2023 17:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Flattened inf params #152

Flattened inf params #152

gkumbhat commented Aug 29, 2023 •

edited

Loading

alex-jw-brooks left a comment

alex-jw-brooks Aug 29, 2023

alex-jw-brooks Aug 29, 2023

gkumbhat Aug 29, 2023

alex-jw-brooks Aug 29, 2023

tharapalanivel Aug 30, 2023

tharapalanivel Aug 30, 2023

Flattened inf params #152

Flattened inf params #152

Conversation

gkumbhat commented Aug 29, 2023 • edited Loading

alex-jw-brooks left a comment

Choose a reason for hiding this comment

alex-jw-brooks Aug 29, 2023

Choose a reason for hiding this comment

alex-jw-brooks Aug 29, 2023

Choose a reason for hiding this comment

gkumbhat Aug 29, 2023

Choose a reason for hiding this comment

alex-jw-brooks Aug 29, 2023

Choose a reason for hiding this comment

tharapalanivel Aug 30, 2023

Choose a reason for hiding this comment

tharapalanivel Aug 30, 2023

Choose a reason for hiding this comment

gkumbhat commented Aug 29, 2023 •

edited

Loading