-
Notifications
You must be signed in to change notification settings - Fork 45
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Allow User to improve OCR text #887
Comments
Currently there is a funded project at UB Mannheim and SLUB Dresden with the goal of integrating Kitodo with OCR-D. Part of the project is a tool for the DFG-Viewer/Kitodo.Presentation by which users can give feedback on OCR-processed text. I am not sure if this includes a basic workflow for corrections. |
I already had the same idea. For the frontend it might be sufficient to allow editing in the existing text view (that's a simple change) and add a submit button which sends the updated text back to the provider where it can be processed further. At least for smaller changes line matching would still be possible, and the provider could decide how to review and integrate the corrections. If the changes are stored in a local Git repository on the provider side, the presentation could be modified to select among the different revisions. That would also allow updates with new OCR results by the provider. |
Votes: 2 |
Description
Basic idea is, that a User for KITODO.PRESENTATION can improve the OCR text.
To do this, a kind of editing dialog is needed, where the User can edit the text.
To make the further processing more easily, it might be a good idea, to allow editing only line-by-line,
so that it is always known, to which line (ALTO-TextLine) an editing belongs to.
As minimum requirement this editing is somehow stored (e.g. a file referencing document, ALTO-TextLine, old Text, new Text).
This file can be used to merge back to original archive, repository or similar (and change according ALTO-File there) and also to update KITODO.PRESENTATION data from time to time.
As additional requirements
Expected Benefits of this Development
This would bring KITODO.PRESENTATION to a status where it can compete with commercial presentation tools.
Estimated Costs and Complexity
I cannot estimate the effort/cost.
The text was updated successfully, but these errors were encountered: