Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

What am I doing wrong? #656

Open
johnlockejrr opened this issue Nov 4, 2024 · 2 comments
Open

What am I doing wrong? #656

johnlockejrr opened this issue Nov 4, 2024 · 2 comments

Comments

@johnlockejrr
Copy link

johnlockejrr commented Nov 4, 2024

I'm trying since some time to fine tune a segmentation model for Syriac script with vowels (above and below line), I'm getting closer but not enough. Kraken seems to refuse to comply :)

Ground truth (page-xml):

seg_manual

After fine tuning with:
ketos segtrain -d cuda:0 -f page -t output.txt -q early --min-epochs 100 -cl --threads 10 --resize both --schedule reduceonplateau -i BiblIAlong02_se3_2_tl.mlmodel -o out/syrnt_cl_v1

seg_auto

Any idea? Should I just use bounding boxes around the baselines and not polygons?

@johnlockejrr
Copy link
Author

johnlockejrr commented Nov 4, 2024

Fine tuning on blla way better for text lines but loss in classes...

(ketos segtrain -d cuda:0 -f page -t output.txt -q early --min-epochs 60 --threads 10 --resize both --schedule reduceonplateau -i blla.mlmodel -o out_blla/syrnt_blla_v1)

image

@johnlockejrr
Copy link
Author

johnlockejrr commented Nov 5, 2024

Now is way better... anyway, should I add more padding?

[ketos segtrain -d cuda:0 -f page -t output-syrnt_cl_two.txt -q early --min-epochs 50 --threads 10 --resize both --schedule reduceonplateau -i blla.mlmodel -o out_blla/syrnt_two_blla_v1]

Screenshot 2024-11-05 115910

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant