Feature: Fine-tune TinyLlama and Qwen2.5-coder models for Magistrala and Prism codebase #27

drasko · 2024-10-09T18:55:24Z

Is your feature request related to a problem? Please describe.

No

Describe the feature you are requesting, as well as the possible use case(s) for it.

As LLM can fine-tuned on custom data sets, so can SLMs.

We want to fine-tune:

TinyLlama
Phi-3

And we want to fine-tune them on our custom Magistrala, Prism and Cocos repositories, so that we can enhance their intelligence for code generation for our purposes.

We want to compare:

Which is better to fine-tune - more documented, easier, faster, etc ...
Which shows better result after fine-tuning

Some references:

Analysis should be done if we should use fine-tuning or RAG for this purpose: https://medium.com/@bijit211987/when-to-apply-rag-vs-fine-tuning-90a34e7d6d25

Indicate the importance of this feature to you.

Must-have

Anything else?

No response

drasko · 2024-10-09T22:05:07Z

Actually, https://github.com/QwenLM/Qwen2.5-Coder seems more promising - especially that https://ollama.com/library/qwen2.5-coder:1.5b works fast.

We should examine how this one can be fine-tuned as well.

drasko · 2024-10-09T22:28:53Z

https://qwenlm.github.io/blog/qwen2.5-coder/

drasko · 2024-10-09T22:44:48Z

This probably should be done wia SWIFT, as explained here.

drasko · 2024-10-09T23:08:06Z

Definitely qwen2.5-coder:1.5b should be set as our default model for now, tested and confirmed by @dborovcanin as well.

@JeffMboya and @rodneyosodo test performance, and if OK, send a PR to replace TinyLlama.

drasko added the enhancement New feature or request label Oct 9, 2024

drasko assigned rodneyosodo Oct 9, 2024

drasko changed the title ~~Feature: Fine-tune Phi-3 and/or TinyLlama models for Magistrala and Prism codebase~~ Feature: Fine-tune Phi-3 and TinyLlama models for Magistrala and Prism codebase Oct 9, 2024

drasko assigned JeffMboya and unassigned rodneyosodo Oct 9, 2024

JeffMboya mentioned this issue Oct 14, 2024

DRAFT-UV-27 - Fine-tune TinyLlama and Qwen2.5-coder models for Magistrala and Prism codebase #29

Closed

JeffMboya changed the title ~~Feature: Fine-tune Phi-3 and TinyLlama models for Magistrala and Prism codebase~~ Feature: Fine-tune TinyLlama and Qwen2.5-coder models for Magistrala and Prism codebase Oct 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature: Fine-tune TinyLlama and Qwen2.5-coder models for Magistrala and Prism codebase #27

Feature: Fine-tune TinyLlama and Qwen2.5-coder models for Magistrala and Prism codebase #27

drasko commented Oct 9, 2024

drasko commented Oct 9, 2024

drasko commented Oct 9, 2024

drasko commented Oct 9, 2024 •

edited

Loading

drasko commented Oct 9, 2024

Feature: Fine-tune TinyLlama and Qwen2.5-coder models for Magistrala and Prism codebase #27

Feature: Fine-tune TinyLlama and Qwen2.5-coder models for Magistrala and Prism codebase #27

Comments

drasko commented Oct 9, 2024

Is your feature request related to a problem? Please describe.

Describe the feature you are requesting, as well as the possible use case(s) for it.

Indicate the importance of this feature to you.

Anything else?

drasko commented Oct 9, 2024

drasko commented Oct 9, 2024

drasko commented Oct 9, 2024 • edited Loading

drasko commented Oct 9, 2024

drasko commented Oct 9, 2024 •

edited

Loading