huggingface-trainer-examples

Huggingface Trainer can be used for customized structures. Read Huggingface Transformers Trainer as a general PyTorch trainer for more detail.

The code is organized around huggingface transformers Trainer. Thus, it is modularized, clean, and easy to modify. And the user can enjoy the great logging utility and easy distributed training on multiple GPUs provided by Trainer.

The major dependencies are huggingface transformers and torch. While some bert4torch code is imported, those pieces are in fact short and standalone, and can be copied and pasted with ease if you don't want an extra package.

Why not use bert4torch directly? More standard huggingface transformers integration and clean code are the pursuit of this repository. Yet quite many useful modules and tricks are implemented in bert4torch, so it is still a good reference.

Directory structure

examples: python scripts
data: datasets
pretrained_models: huggingface models

Datasets

Datasets are majorly available here or as follows.

Datasets	Usage	Downloads
人民日报数据集	实体识别	china-people-daily-ner-corpus
百度关系抽取	关系抽取	BD_Knowledge_Extraction
Sentiment	情感分类	Sentiment
THUCNews	文本分类、文本生成	THUCNews
ATEC	文本相似度	ATEC
BQ	文本相似度	BQ
LCQMC	文本相似度	LCQMC
PAWSX	文本相似度	PAWSX
STS-B	文本相似度	STS-B
CSL	文本生成	CSL
THUCNews_sample	文本分类	Bert-Chinese-Text-Classification-Pytorch

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
examples		examples
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

huggingface-trainer-examples

Directory structure

Datasets

About

License

Shiina18/huggingface-trainer-examples

Folders and files

Latest commit

History

Repository files navigation

huggingface-trainer-examples

Directory structure

Datasets

About

Resources

License

Stars

Watchers

Forks