fewshot generation

This repository is for few-shot text generation research;

Task Description

Given a handful of training samples, we explore to maximize the adaption performance for CausalLM task.
Specificlly, we use GPT-2 as backbone model and predict next 40 tokens given the previous 200 tokens. We also take the following practical requirements into consideration:

Parameter-efficiency
Generalization ability to any new domain
Stability in change of shotnums

Usage

prepare data

Unzip data.zip to repository main directory.
We provide corpus from 5 different domains: gongwen, international news, peotry, sports news, and short stories.

enviroments

pip install -r requirements.txt

train model

python train.py --shotnum $shotnum --domain $domain --adaption_type $adaption_type
And the model will be saved to save/ directory by default.

test model

python test.py --shotnum $shotnum --domain $domain --adaption_type $adaption_type
And the prediction file will be saved to pred/ directory by default.

command arguments

$shotnum: number of examples, possible values: {0,4,8,16,32,64,128};
$domain: domain of adaption, {'gongwen', 'international', 'peotry', 'sports', 'story'};
$adaption_type: 'finetune', 'adapter', 'lora', or 'retrieval'; indicate methods of adpation to target domain;

'finetune': Traditional full-parameter adaption;

'adapter': Parameter-efficient tuning by adding parameter blocks, paper: https://arxiv.org/pdf/1902.00751.pdf;

'lora': Parameter-efficient tuning by adding low-rank matrics, paper: https://arxiv.org/pdf/2106.09685.pdf;

'retrieval': Input encodings of retrieved passages as reference. Training with this settings will add cross-attention blocks and freeze other parameters. The result should be a domain-agnostic LM with ability to consult given passages.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.gitignore		.gitignore
README.md		README.md
data_parallel.py		data_parallel.py
dataset.py		dataset.py
generate.py		generate.py
http_service.py		http_service.py
preprocess.py		preprocess.py
requirements.txt		requirements.txt
test.py		test.py
test.sh		test.sh
train.py		train.py
train.sh		train.sh
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

fewshot generation

Task Description

Usage

prepare data

enviroments

train model

test model

command arguments

Results

BLEU

BERTScore

Rouge-2

About

Releases

Packages

Contributors 2

Languages

h-guo18/fewshotgen

Folders and files

Latest commit

History

Repository files navigation

fewshot generation

Task Description

Usage

prepare data

enviroments

train model

test model

command arguments

Results

BLEU

BERTScore

Rouge-2

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages