DRAFT-UV-27 - Fine-tune TinyLlama and Qwen2.5-coder models for Magistrala and Prism codebase #29

JeffMboya · 2024-10-14T12:44:07Z

What type of PR is this?

This is a feature

What does this do?

This PR fine tunes TinyLlama:1.1b and Qwen2.5-coder:1.5b for Magistrala and Prism codebase

Which issue(s) does this PR fix/relate to?

Resolves Feature: Fine-tune TinyLlama and Qwen2.5-coder models for Magistrala and Prism codebase #27

Have you included tests for your changes?

No, manually tested

Did you document any new/modified features?

No

Notes

Fine tuned model under test located at: https://huggingface.co/J1997/fine_tuned_qwen_2.5_coder_1.5b

drasko · 2024-10-14T23:12:33Z

We will not change TinyLlama with Qwen unless we prove that fine-tuning is equally simple.

So please go through fine-tuning procedure and compare the approaches.

This tool: https://github.com/yamadashy/repopack can be useful to pack Magistrala and CocosAI repos for fine-tuning of moels, but probably there are other tools - so also existing tools and approaches must be studied in order to find the most suitable one.

Signed-off-by: JeffMboya <[email protected]>

drasko · 2024-11-19T14:12:14Z

Let's keep TinyLlama for now, it is super fast on CPU, which is good for demos.

Replace tinyllama with qwen

d917706

Signed-off-by: JeffMboya <[email protected]>

JeffMboya force-pushed the UV-27 branch from 164ff1c to d917706 Compare November 8, 2024 09:19

drasko closed this Nov 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DRAFT-UV-27 - Fine-tune TinyLlama and Qwen2.5-coder models for Magistrala and Prism codebase #29

DRAFT-UV-27 - Fine-tune TinyLlama and Qwen2.5-coder models for Magistrala and Prism codebase #29

JeffMboya commented Oct 14, 2024 •

edited

Loading

drasko commented Oct 14, 2024

drasko commented Nov 19, 2024

DRAFT-UV-27 - Fine-tune TinyLlama and Qwen2.5-coder models for Magistrala and Prism codebase #29

DRAFT-UV-27 - Fine-tune TinyLlama and Qwen2.5-coder models for Magistrala and Prism codebase #29

Conversation

JeffMboya commented Oct 14, 2024 • edited Loading

What type of PR is this?

What does this do?

Which issue(s) does this PR fix/relate to?

Have you included tests for your changes?

Did you document any new/modified features?

Notes

drasko commented Oct 14, 2024

drasko commented Nov 19, 2024

JeffMboya commented Oct 14, 2024 •

edited

Loading