Untangling_LMs

This repository contains one of my personal projects: creating an automatic error checking system using the LLM m-DeBERTa-V3, a brand new model released in 2023 by He et al. (2023)

To run the model you need to:

Download the Jupyter Notebook fine_tune_mDeBERTaV3.ipynb
Download the data from Spraakbanken: https://github.com/spraakbanken/multiged-2023
Obtain the permission from Spraakbanken to obtain the test data
Use the eval.py file to compare your predictions with the truth values

I am not allowed to share the data myself, but simply download it from Spraakbanken's repository and ask for the permission to use the labelled test set.

This project is a work in progress. My goal for the next few weeks is to improve accuracy/precision/recall/F0.5 scores. There are multiple options that I am exploring:

Testing different loss functions (colwise MSE, cross-entropy loss)
Reducing the padding length
Changing the shape of the tensors

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
thesis_data		thesis_data
README.md		README.md
eval.py		eval.py
fine_tune_mDeBERTaV3.ipynb		fine_tune_mDeBERTaV3.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Untangling_LMs

About

Releases

Packages

Languages

CelineNausicaa/Untangling_LMs

Folders and files

Latest commit

History

Repository files navigation

Untangling_LMs

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages