How can we use Llama2 here? #10

shivprasad94 · 2023-08-04T06:29:35Z

I see from the code repo that we are using OpenAI APIs, how can we make this work for Open source models like Llama2?
can someone give me detail on this and what steps I need to follow?

stefanhgm · 2023-09-27T12:00:09Z

Hello @shivprasad94,

sorry for the late reply and thanks for reaching out!

TabLLM is LLM agnostic, so you can use whatever LLM you want. For instance, to use another HuggingFace model you could create a new json config in TabLLM/t-few/configs (e.g. llama.json) and use the model specifier for the original_model parameter (e.g., "origin_model": "meta-llama/Llama-2-7b").

You can then use this model configuration in the run configuration few-shot-pretrained-100k.sh in line 18 as for model in 'llama'..

Let us know if you need any further help!

RyanJJP · 2024-05-06T19:10:43Z

There seems to be something wrong with t-few when finetune since LLaMA is not an EncoderDecoder model

stefanhgm · 2024-05-07T14:22:19Z

Hello @RyanJJP,

thanks for this additional comment. You are right, t-few might not work with LLaMA. However, other fine-tuning methods for LLaMA (e.g. QLoRA) should allow a similar functionality. This would require larger changes to the code basis but conceptually it should be similar.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How can we use Llama2 here? #10

How can we use Llama2 here? #10

shivprasad94 commented Aug 4, 2023

stefanhgm commented Sep 27, 2023 •

edited

Loading

RyanJJP commented May 6, 2024

stefanhgm commented May 7, 2024

How can we use Llama2 here? #10

How can we use Llama2 here? #10

Comments

shivprasad94 commented Aug 4, 2023

stefanhgm commented Sep 27, 2023 • edited Loading

RyanJJP commented May 6, 2024

stefanhgm commented May 7, 2024

stefanhgm commented Sep 27, 2023 •

edited

Loading