Effects of Transformer Distillation on Question Answering

This repository contains scripts, notebooks, data, and a report from a project exploring the effects of transformer model distillation on the question answering task. In particular, we fine-tune BERT, RoBERTa, distilBERT, and distilRoBERTa on the Stanford Question Answering Dataset (SQuAD) and compare model performance, model training time, and model inference time.

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
baseline		baseline
data		data
distilbert		distilbert
distilroberta		distilroberta
evaluation		evaluation
notebooks		notebooks
plots		plots
published_baseline		published_baseline
roberta		roberta
src		src
.gitignore		.gitignore
README.md		README.md
report.pdf		report.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Effects of Transformer Distillation on Question Answering

About

Releases

Packages

Contributors 4

Languages

mukund-v/distil-on-squad

Folders and files

Latest commit

History

Repository files navigation

Effects of Transformer Distillation on Question Answering

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages