Skip to content

Classifier based on my previously annotated speech-act-analysis dataset

Notifications You must be signed in to change notification settings

MelinaPl/speech-act-classifier

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

31 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Speech Act Classifier

Speech act classifier based on speech-act-analysis dataset.

How to train a speech act classifier

Step 1: Download dataset and create dataset splits

File: src/create_splits.py

$ python create_splits.py coarse version_1-1.json
$ python create_splits.py fine version_1-1.json
$ python create_splits.py merged version_1-1.json

Step 2: Train classifier

File: src/train.py

  • Create empty model directory in src/
  • Choose from the following models: "dbmdz/bert-base-german-uncased", "dbmdz/bert-base-german-cased", "deepset/gbert-base", "deepset/gelectra-base"
$ python train.py dbmdz/bert-base-german-uncased  fine

Step 3: Evaluate

File: src/evaluate.py

$ python evaluate.py MODELNAME DATAVERSION PATH_TO_CHECKPOINT

About

Classifier based on my previously annotated speech-act-analysis dataset

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published