finetuning neuralchat-7b using intel(r) extension for transformers and workflow interface #907

kta-intel · 2024-01-08T17:24:45Z

Description: this PR add an example of fine-tuning neuralchat-7b on a medical qa dataset using the experimental workflow interface and intel(r) extension for transformers

Objectives:

demonstrate OpenFL support for fine-tuning LLMs in a federated learning workflow and provide example users may follow
demonstrate OpenFL support for Intel(R) Extension for Transformers by fine-tuning the Intel neuralchat-7b model

Changes:
(+) preprocess_dataset.py: to preprocess the MedQuAD dataset to be ingestible by the model and workflow
(+) Workflow_Interface_NeuralChat.ipynb: tutorial notebook
(+) requirements.txt
(mod) stream_redirect.py: resolution for AttributeError: 'RedirectStdStream' object has no attribute 'flush', caused by Trainer

Signed-off-by: kta-intel <[email protected]>

…l/openfl into kta/neuralchat_finetune

openfl-tutorials/experimental/LLM/neuralchat/Workflow_Interface_NeuralChat.ipynb

Signed-off-by: kta-intel <[email protected]>

… point toward original requirements.txt Signed-off-by: kta-intel <[email protected]>

Signed-off-by: kta-intel <[email protected]>

psfoley

Thank you for the great contribution, @kta-intel! Approved

…d workflow interface (securefederatedai#907) * implementation of neuralchat-7b finetuning using itrex and openfl Signed-off-by: kta-intel <[email protected]> * enabling support for itrex-neuralchat with openfl workflow-interface Signed-off-by: kta-intel <[email protected]> * updated model and description Signed-off-by: kta-intel <[email protected]> * fix colab link and add citation Signed-off-by: kta-intel <[email protected]> * add readme and additional setup and preprocess steps Signed-off-by: kta-intel <[email protected]> * fix preprocess step Signed-off-by: kta-intel <[email protected]> * modify readme, fix preprocess_dataset.py, add setup steps in notebook Signed-off-by: kta-intel <[email protected]> * fix lint issues Signed-off-by: kta-intel <[email protected]> * remove whitespace in preprocess_data.py Signed-off-by: kta-intel <[email protected]> * removed some extra torch.saves that were being used for debugging Signed-off-by: kta-intel <[email protected]> * deleted new requirements.txt files and modified setup instructions to point toward original requirements.txt Signed-off-by: kta-intel <[email protected]> * fix typo in notebook Signed-off-by: kta-intel <[email protected]> * fix typo in notebook Signed-off-by: kta-intel <[email protected]> --------- Signed-off-by: kta-intel <[email protected]> Signed-off-by: nammbash <[email protected]>

…d workflow interface (#907) * implementation of neuralchat-7b finetuning using itrex and openfl Signed-off-by: kta-intel <[email protected]> * enabling support for itrex-neuralchat with openfl workflow-interface Signed-off-by: kta-intel <[email protected]> * updated model and description Signed-off-by: kta-intel <[email protected]> * fix colab link and add citation Signed-off-by: kta-intel <[email protected]> * add readme and additional setup and preprocess steps Signed-off-by: kta-intel <[email protected]> * fix preprocess step Signed-off-by: kta-intel <[email protected]> * modify readme, fix preprocess_dataset.py, add setup steps in notebook Signed-off-by: kta-intel <[email protected]> * fix lint issues Signed-off-by: kta-intel <[email protected]> * remove whitespace in preprocess_data.py Signed-off-by: kta-intel <[email protected]> * removed some extra torch.saves that were being used for debugging Signed-off-by: kta-intel <[email protected]> * deleted new requirements.txt files and modified setup instructions to point toward original requirements.txt Signed-off-by: kta-intel <[email protected]> * fix typo in notebook Signed-off-by: kta-intel <[email protected]> * fix typo in notebook Signed-off-by: kta-intel <[email protected]> --------- Signed-off-by: kta-intel <[email protected]> Signed-off-by: manuelhsantana <[email protected]>

kta-intel and others added 5 commits December 12, 2023 14:59

implementation of neuralchat-7b finetuning using itrex and openfl

8ca4fa0

Signed-off-by: kta-intel <[email protected]>

enabling support for itrex-neuralchat with openfl workflow-interface

9d01868

Signed-off-by: kta-intel <[email protected]>

Merge branch 'securefederatedai:develop' into kta/neuralchat_finetune

8770aaf

updated model and description

851f114

Signed-off-by: kta-intel <[email protected]>

Merge branch 'kta/neuralchat_finetune' of https://github.com/kta-inte…

2f7868a

…l/openfl into kta/neuralchat_finetune

kta-intel marked this pull request as draft January 8, 2024 17:25

psfoley reviewed Jan 8, 2024

View reviewed changes

openfl-tutorials/experimental/LLM/neuralchat/Workflow_Interface_NeuralChat.ipynb Outdated Show resolved Hide resolved

fix colab link and add citation

58b01c4

Signed-off-by: kta-intel <[email protected]>

kta-intel marked this pull request as ready for review January 10, 2024 19:12

kta-intel changed the title ~~[WIP] finetuning neuralchat-7b using intel(r) extension for transformers and workflow interface~~ finetuning neuralchat-7b using intel(r) extension for transformers and workflow interface Jan 10, 2024

kta-intel changed the title ~~finetuning neuralchat-7b using intel(r) extension for transformers and workflow interface~~ [WIP] finetuning neuralchat-7b using intel(r) extension for transformers and workflow interface Jan 10, 2024

kta-intel marked this pull request as draft January 10, 2024 19:13

kta-intel added 3 commits January 16, 2024 15:32

add readme and additional setup and preprocess steps

2813524

Signed-off-by: kta-intel <[email protected]>

fix preprocess step

575e243

Signed-off-by: kta-intel <[email protected]>

modify readme, fix preprocess_dataset.py, add setup steps in notebook

11900e6

Signed-off-by: kta-intel <[email protected]>

kta-intel marked this pull request as ready for review January 19, 2024 20:37

kta-intel added 2 commits January 19, 2024 12:43

fix lint issues

e98ca1d

Signed-off-by: kta-intel <[email protected]>

remove whitespace in preprocess_data.py

3fbc05c

Signed-off-by: kta-intel <[email protected]>

kta-intel changed the title ~~[WIP] finetuning neuralchat-7b using intel(r) extension for transformers and workflow interface~~ finetuning neuralchat-7b using intel(r) extension for transformers and workflow interface Jan 25, 2024

kta-intel and others added 5 commits January 26, 2024 13:50

removed some extra torch.saves that were being used for debugging

9ebee87

Signed-off-by: kta-intel <[email protected]>

Merge branch 'securefederatedai:develop' into kta/neuralchat_finetune

ffe9621

deleted new requirements.txt files and modified setup instructions to…

79a6f56

… point toward original requirements.txt Signed-off-by: kta-intel <[email protected]>

fix typo in notebook

c27dde6

Signed-off-by: kta-intel <[email protected]>

fix typo in notebook

101262e

Signed-off-by: kta-intel <[email protected]>

psfoley approved these changes Feb 23, 2024

View reviewed changes

psfoley merged commit 8e69760 into securefederatedai:develop Feb 23, 2024
23 of 26 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

finetuning neuralchat-7b using intel(r) extension for transformers and workflow interface #907

finetuning neuralchat-7b using intel(r) extension for transformers and workflow interface #907

kta-intel commented Jan 8, 2024

psfoley left a comment

finetuning neuralchat-7b using intel(r) extension for transformers and workflow interface #907

finetuning neuralchat-7b using intel(r) extension for transformers and workflow interface #907

Conversation

kta-intel commented Jan 8, 2024

psfoley left a comment

Choose a reason for hiding this comment