[Model] Add the Infinity-Instruct SFT code #278

CathySama · 2024-11-26T04:03:14Z

No description provided.

aoyulong · 2024-11-27T02:20:30Z

examples/qwen/conf_qwen/train/train_qwen_2.5_1.5b.yaml

Remove the train_ prefix from the file names since they are in the train directory.

aoyulong · 2024-11-27T02:23:17Z

examples/qwen/dist_start.sh

We have updated the launcher to use the unified run.py. Please remove dist_start.sh, dist_stop.sh, env.sh, and run.sh.

These files (dist_start.sh, dist_stop.sh, env.sh, run.sh and the directories of tokenizers) have been removed.

aoyulong · 2024-11-27T02:25:46Z

flagscale/train/train_aquila.py

    total_tokens = loss_mask.sum()
-    loss = torch.cat([torch.sum(losses.view(-1) * loss_mask).view(1), total_tokens.view(1)])
+
+    loss = torch.cat([torch.sum(torch.masked_select(losses.view(-1) , loss_mask==1)).view(1), total_tokens.view(1)])


This loss will also be used in pre-training. If the SFT requires a different one, we may need a better way to distinguish them.

I add a new file called train_aquila_sft.py to distinguish the loss.

aoyulong

LGTM

aoyulong

Please rename conf_qwen to conf since this folder is already in the parent qwen folder.

In /conf/train/qwen_2.5_1.5b.yaml, adding ckpt_format, ckpt_convert_format and ckpt_convert_save to convert checkpoints .

CathySama · 2024-11-29T01:57:57Z

Have renamed conf_qwen.

aoyulong

LGTM

CathySama added 4 commits November 26, 2024 11:37

qwen update

ec41313

Update run.sh

ca34eec

Update train_aquila.py

41598b3

Update convert.py

9d2c805

CathySama requested a review from a team as a code owner November 26, 2024 04:03

CathySama changed the title ~~fyc update flagscale~~ add Infinity-Instruct SFT code Nov 26, 2024

aoyulong reviewed Nov 27, 2024

View reviewed changes

CathySama added 11 commits November 27, 2024 13:43

update train qwen2.5_1.5b.yaml

233f732

Delete examples/qwen/conf_qwen/train directory

efd4491

update. train qwen2.5_1.5b.yaml

020a446

Delete examples/qwen directory

c832d3b

update qwen

23a2e1c

Delete flagscale/train directory

96006cf

add train_aquila_sft.py

9bb416b

Delete examples/qwen/tokenizer_hf directory

8c12364

Delete examples/qwen/tokenizer directory

2961291

Update config.yaml - entrypoint

76a377d

Update config_qwen2.5_1.5b.yaml - entrypoint

ca7d692

aoyulong previously approved these changes Nov 28, 2024

View reviewed changes

aoyulong reviewed Nov 28, 2024

View reviewed changes

Delete examples/qwen/conf_qwen directory

975e26e

CathySama dismissed aoyulong’s stale review via 975e26e November 29, 2024 01:54

Rename "conf_qwen" to "conf" and update train

a1edbf6

In /conf/train/qwen_2.5_1.5b.yaml, adding ckpt_format, ckpt_convert_format and ckpt_convert_save to convert checkpoints .

aoyulong changed the title ~~add Infinity-Instruct SFT code~~ [Model] Add the Infinity-Instruct SFT code Dec 3, 2024

aoyulong approved these changes Dec 3, 2024

View reviewed changes

aoyulong merged commit 39d1775 into FlagOpen:main Dec 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Model] Add the Infinity-Instruct SFT code #278

[Model] Add the Infinity-Instruct SFT code #278

CathySama commented Nov 26, 2024

aoyulong Nov 27, 2024

aoyulong Nov 27, 2024

CathySama Nov 28, 2024

aoyulong Nov 27, 2024

CathySama Nov 28, 2024

aoyulong left a comment

aoyulong left a comment

CathySama commented Nov 29, 2024

aoyulong left a comment

[Model] Add the Infinity-Instruct SFT code #278

[Model] Add the Infinity-Instruct SFT code #278

Conversation

CathySama commented Nov 26, 2024

aoyulong Nov 27, 2024

Choose a reason for hiding this comment

aoyulong Nov 27, 2024

Choose a reason for hiding this comment

CathySama Nov 28, 2024

Choose a reason for hiding this comment

aoyulong Nov 27, 2024

Choose a reason for hiding this comment

CathySama Nov 28, 2024

Choose a reason for hiding this comment

aoyulong left a comment

Choose a reason for hiding this comment

aoyulong left a comment

Choose a reason for hiding this comment

CathySama commented Nov 29, 2024

aoyulong left a comment

Choose a reason for hiding this comment