neuraltextgen

Implementation of a text-generation method using BERT, starting from the methodology proposed in the paper "BERT has a Mouth, and It Must Speak: BERT as a Markov Random Field Language Model" - Alex Wang, Kyunghyun Cho (https://arxiv.org/pdf/1902.04094.pdf). The method is based on an iterative procedure based on three main steps:

initialization: of the batch of sentences. Each sentence is composed by a list of tokens initialized as '[MASK]'
sampling: at each iteration one token for each sentence is selected randomly
replacement: at each iteration the tokens selected are replaced randomly based on the logits outputted by BERT

Extensions:

implementation of new initialization method that give the possibility to choose between all "[MASK]" tokens, all random tokens or a mix of the two, based on a single parameter for the mask-token probability
implementation of new sampling method based on the attention of the token chosen on the previous iteration
implementation of a unique framework able to deal with different languages
currently implementing fine-tuning on specific tasks for an italian model

Name		Name	Last commit message	Last commit date
Latest commit History 211 Commits
RNN		RNN
data		data
evaluation		evaluation
images		images
out		out
paper		paper
texygen @ 3104e22		texygen @ 3104e22
.gitignore		.gitignore
.gitmodules		.gitmodules
Bert_V2.ipynb		Bert_V2.ipynb
Comparison_of_Attention_Methods.ipynb		Comparison_of_Attention_Methods.ipynb
Example_finetuning.ipynb		Example_finetuning.ipynb
Finetuning.ipynb		Finetuning.ipynb
NeuralTextGenerator.py		NeuralTextGenerator.py
README.md		README.md
italian_text_generation.ipynb		italian_text_generation.ipynb
textprocessing.py		textprocessing.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

neuraltextgen

About

Releases

Packages

Contributors 3

Languages

JuanJoseMV/neuraltextgen

Folders and files

Latest commit

History

Repository files navigation

neuraltextgen

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages