Add support for other base LLM models #288

neubig · 2023-08-25T14:52:58Z

Currently prompt2model only supports using OpenAI as the base LLM that is used for distillation, etc. It would be good to allow other models, including open models, as the base LLM.

One way we might be able to do this is by re-using general purpose inference code in libraries such as Zeno Build or litellm.

Anindyadeep · 2023-08-30T02:58:11Z

There are two ways to do this:

Either we host our base LLM and access it through a similar open ai like interface
Or, we directly access our LLMs. (which would require more memory)
Which are we considering here (both?)

neubig · 2023-08-30T15:21:45Z

I think the answer is "both", and I have specific suggestions about how we could do so:

For the API-based generation, we transition to using litellm, which supports many different providers (including hitting hugging face APIs).
We create a meta-function for generating from LLMs that supports generating from either APIs or locally, like this one here (see generate_from_huggingface): https://github.com/zeno-ml/zeno-build/blob/d349ff54cb9eb984b8fdb198e8d09041ea9995e4/zeno_build/models/chat_generate.py#L30

ishaan-jaff · 2023-09-03T05:20:25Z

@neubig thanks for mentioning LiteLLM - i'm one of the maintainers of LiteLLM. Happy to make a PR for integrating😊. Will make one on the next 48 hrs

saum7800 · 2023-09-03T06:05:49Z

Hey @ishaan-jaff ! Thank you so much for your comment. I had started working on this a day back. Happy to see you would like to contribute. Will you be integrating the LiteLLM part of the solution? Then I can focus on structuring the generation from huggingface models on top of that. Let me know, thanks!

ishaan-jaff · 2023-09-05T18:01:45Z

Pr here for tracking: #324

thanks @krrishdholakia!

bilal-aamer · 2024-01-01T08:52:03Z

Looks like there's a merged PR for this ticket, is this still standing for the integration of recent base LLMs?

saum7800 · 2024-01-01T08:53:46Z

Hey @bilal-aamer ! We merged in LiteLLM to take care of this. Should we close the issue @neubig ?

neubig · 2024-01-01T13:49:12Z

Yes, I think this can be closed!

neubig added enhancement New feature or request good first issue Good for newcomers labels Aug 25, 2023

neubig mentioned this issue Aug 30, 2023

Support for Azure OpenAI Service #310

Closed

neubig assigned neubig and unassigned neubig Aug 31, 2023

saum7800 self-assigned this Aug 31, 2023

neubig closed this as completed Jan 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for other base LLM models #288

Add support for other base LLM models #288

neubig commented Aug 25, 2023

Anindyadeep commented Aug 30, 2023

neubig commented Aug 30, 2023

ishaan-jaff commented Sep 3, 2023 •

edited

Loading

saum7800 commented Sep 3, 2023

ishaan-jaff commented Sep 5, 2023 •

edited

Loading

bilal-aamer commented Jan 1, 2024

saum7800 commented Jan 1, 2024

neubig commented Jan 1, 2024

Add support for other base LLM models #288

Add support for other base LLM models #288

Comments

neubig commented Aug 25, 2023

Anindyadeep commented Aug 30, 2023

neubig commented Aug 30, 2023

ishaan-jaff commented Sep 3, 2023 • edited Loading

saum7800 commented Sep 3, 2023

ishaan-jaff commented Sep 5, 2023 • edited Loading

bilal-aamer commented Jan 1, 2024

saum7800 commented Jan 1, 2024

neubig commented Jan 1, 2024

ishaan-jaff commented Sep 3, 2023 •

edited

Loading

ishaan-jaff commented Sep 5, 2023 •

edited

Loading