trec-coding

Symbols classification for automatic correction for the coding part of the TREC psychometric test

Streamlit app

To run app in local, use streamlit run treccoding.py Link to app on streamlit community cloud here

To try it, use one of the resultsheets from scan_resultsheets folder.

Better to run community cloud app on Chrome browser. On Firefox, problems with file uploading can happen.

Jupyter notebooks

SymbolsAnnotation and SymbolsCorrection follow the same workflow as the streamlit app, but in a notebook format
SymbolsTraining is used for hyperparameters tuning and training of the neural network
SymbolsDataAugmentation is used to generate new pictures to increase the dataset size, and train again the model

Workflow of the app

For a classic workflow starting from a scan of a test resultsheet, first detect the boxes coordinates and then extract region of interest (ROI), the pixels contained inside the coordinates (the symbols). All these steps are performed on the homepage treccoding.py. Then there are two options :

if you just want to annotate data and save it as .csv, use Annotation page
if you want to correct a resultsheet and obtain a score, use Correction page
- there you can first run automatic correction using the model
- if the automatic correction is not satisfying, you can adjust it manually
- it is always possible to save this as a .csv dataset containing symbols + correct labels

Score is calculated as : number of filled boxes - number of errors

An alternative workflow consists in going directly to the Modele page. There it is possible to load the train dataset or the test dataset. Or upload a .csv file containing a dataset with the same format as the one used to save in the other pages, meaning :

one line per image
each line contain the flatten pixel values (0-255), of an image of shape width = 76 and height = 82
last column of the line is the label of the image (0-9, 0 meaning error : unexisting symbol)

From there all the available models can be evaluated, by running prediction and calculating metrics and confusion table. Best model so far is CNN_10-0.996_20240611_t2.keras

Improvements

CV and interface aspects

Extraction : Display scanned resultsheet in transparency to see the grid better, and grid in black lines
Add the possibility to take a picture of the scanned result sheet
- Need improvement of box detection even with variation in sheet dimension : try SAM or keras CV module
Label management issues :
- Set default label on homepage roi display
- Empty boxes exclusion may cause problems in annotation / correction, due to the callback function used

Deep learning aspects

Encoding of labels
Confusion matrix aesthetics: normalisation on true or pred, possibility to hide diagonal
Implement custom metrics and loss in training fit
Image augmentation for error generation

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
data_resultsheets		data_resultsheets
models		models
pages		pages
scan_resultsheets		scan_resultsheets
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
SymbolsAnnotation.ipynb		SymbolsAnnotation.ipynb
SymbolsCorrection.ipynb		SymbolsCorrection.ipynb
SymbolsDataAugmentation.ipynb		SymbolsDataAugmentation.ipynb
SymbolsTraining.ipynb		SymbolsTraining.ipynb
Test_Threshold.py		Test_Threshold.py
evaluate.py		evaluate.py
extract.py		extract.py
img_display.py		img_display.py
requirements.txt		requirements.txt
tcutil.py		tcutil.py
train.py		train.py
treccoding.py		treccoding.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

trec-coding

Streamlit app

Jupyter notebooks

Workflow of the app

Improvements

CV and interface aspects

Deep learning aspects

About

Releases

Packages

Languages

License

csdevignes/trec-coding

Folders and files

Latest commit

History

Repository files navigation

trec-coding

Streamlit app

Jupyter notebooks

Workflow of the app

Improvements

CV and interface aspects

Deep learning aspects

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages