Speech act classifier based on speech-act-analysis dataset.
File: src/create_splits.py
- Create empty data folder in src/
- Download the following script: https://github.com/MelinaPl/speech-act-analysis/blob/main/data/version_1-1.json
$ python create_splits.py coarse version_1-1.json
$ python create_splits.py fine version_1-1.json
$ python create_splits.py merged version_1-1.json
File: src/train.py
- Create empty model directory in src/
- Choose from the following models: "dbmdz/bert-base-german-uncased", "dbmdz/bert-base-german-cased", "deepset/gbert-base", "deepset/gelectra-base"
$ python train.py dbmdz/bert-base-german-uncased fine
File: src/evaluate.py
$ python evaluate.py MODELNAME DATAVERSION PATH_TO_CHECKPOINT