-
Notifications
You must be signed in to change notification settings - Fork 375
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
win10 chinese filename. TesseractException: Error during processing page. #75
Comments
That's more on the Java side than Tess4J. It's suggested that you check for the file existence before attempting to do OCR on the image file. If Java does not support the current file name, you may have to use a different file naming that Java supports. |
thx |
This is not a Java issue. Having the same problem here using german umlauts (äüö) in paths. The files definitely exist and no other part of the software has a problem with it. Platform: Windows 7 I'll try to provide a testcase for you to reproduce. |
I added unit tests to help you reproduce the error at https://github.com/maherm/tess4j |
@maherm I confirm your findings. An interim workaround I can see is rename the file to an ascii name (utilizing Or use |
@nguyenq Thanks for having a look at this.
That is kind of what I do at the moment: making sure there is never a path passed to tess4j that contains non-ascii symbols. It's a rather ugly workaround, but it does the trick at the moment. I propose reopening this issue until it is fixed. |
I run tess4j on win10 TesseractException: Error during processing error page.
Tesseract.createDocuments (Tesseract.java:565)
I think the reason is Chinese file name, but do not know how to solve
The text was updated successfully, but these errors were encountered: