Dev liuyibo #70

rookielyb · 2022-12-30T06:27:42Z

1.增加Keyphrase Generation项目
2.在examples/text_generation/ 新增 Keyphrase Generation的启动项以及少量数据
3.新增gts_engine/pipelines/qiankunding_generation.py 主代码
3.新增gts_engine/qiankunding/dataloaders/text_generation/dataloader_kgt5.py的数据处理代码
4.新增gts_engine/qiankunding/models/text_generation/t5_kg.py 模型代码
5.在gts_engine/qiankunding/utils/evaluation.py 中新增 TextGenerateEvaluator 类用于Generate的评估
6.在gts_engine/qiankunding/utils/tokenization.py中新增 T5的tokenization
7.运行examples/text_generation/run_train_qiankunding.sh 成功如下

8.运行examples/text_generation/run_inference_qiankunding.sh 成功如下

pskun · 2022-12-30T06:53:42Z

examples/text_generation/run_inference_qiankunding.sh

+python gts_engine/gts_engine_inference.py \
+    --task_dir=$TASK_DIR \
+    --engine_type=qiankunding \
+    --task_type=generation \


task type改成keyphrase_generation

pskun · 2022-12-30T06:54:15Z

examples/text_generation/run_train_qiankunding.sh

+    --engine_type=qiankunding \
+    --train_mode=standard \
+    --task_dir=$TASK_DIR \
+    --task_type=generation \


task type改成keyphrase_generation

pskun · 2022-12-30T06:56:07Z

examples/text_generation/run_train_qiankunding.sh

+    --data_dir=$WORK_DIR/examples/text_generation \
+    --save_path=$TASK_DIR/outputs \
+    --pretrained_model_dir=$PRETRAINED_DIR \
+    --train_batchsize=32 \


测过这么大的bs占多少显存吗

pskun · 2022-12-30T07:01:22Z

gts_engine/qiankunding/dataloaders/text_generation/dataloader_kgt5.py

@@ -0,0 +1,141 @@
+import json


目录改成：gts_engine/qiankunding/dataloaders/keyphrase_generation

pskun · 2022-12-30T07:03:08Z

gts_engine/qiankunding/models/text_generation/t5_kg.py

@@ -0,0 +1,181 @@
+from genericpath import exists


改成：gts_engine/qiankunding/models/keyphrase_generation

pskun · 2022-12-30T07:06:07Z

gts_engine/qiankunding/models/text_generation/t5_kg.py

+logger = Logger().get_log()
+
+
+class T5KG(BaseModel):


叫KeyphraseGenerationT5吧，本身单词也不长

pskun · 2022-12-30T07:07:37Z

gts_engine/qiankunding/models/text_generation/t5_kg.py

+        inputs = self.train_inputs(batch)
+        outputs = self.model.generate(
+                    input_ids = inputs['input_ids'],
+                    max_length = 32, 


这个32是拍的？

pskun · 2022-12-30T07:09:11Z

gts_engine/qiankunding/models/text_generation/t5_kg.py

+
+        outputs = self.model.generate(
+                    input_ids = inputs['input_ids'],
+                    max_length=32


32这个，设置成类成员变量吧

pskun · 2022-12-30T07:11:53Z

gts_engine/qiankunding/utils/evaluation.py


-            results.append(pred)
+        TP, total_pred, total_true = 0, 0, 0


这里的计算和前面模型的validation_step差不多，为啥不单独抽象出一个公共的函数

pskun · 2022-12-30T07:12:35Z

gts_engine/qiankunding/utils/tokenization.py

+    special_tokens += [f"[choice{i+1}]" for i in range(200)]
+    # special_tokens += [f"{i+1}" for i in range(200)]
+
+    print("pretrained_model_path", pretrained_model_path)


没用的输出要删掉

liuyibo added 4 commits December 16, 2022 10:39

reduce comments

734506d

合并main

5a436e5

Merge branch 'main' into dev_liuyibo

66ed810

add Keyphrase Generation using T5

2cb12d1

pskun self-requested a review December 30, 2022 06:36

pskun requested changes Dec 30, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dev liuyibo #70

Dev liuyibo #70

rookielyb commented Dec 30, 2022

pskun Dec 30, 2022

pskun Dec 30, 2022

pskun Dec 30, 2022

pskun Dec 30, 2022

pskun Dec 30, 2022

pskun Dec 30, 2022

pskun Dec 30, 2022

pskun Dec 30, 2022

pskun Dec 30, 2022

pskun Dec 30, 2022

		logger = Logger().get_log()


		class T5KG(BaseModel):

Dev liuyibo #70

Are you sure you want to change the base?

Dev liuyibo #70

Conversation

rookielyb commented Dec 30, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment