Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

OOM in cropper #95

Open
bertsky opened this issue May 2, 2022 · 3 comments
Open

OOM in cropper #95

bertsky opened this issue May 2, 2022 · 3 comments

Comments

@bertsky
Copy link
Contributor

bertsky commented May 2, 2022

On a workspace with >500 pages, running the cropper yields a

OSError: [Errno 12] Cannot allocate memory

This happens after VSZ (virtual memory) exceeds 32 GB. In contrast, RSS (resident memory) is still as low as 200 MB.

Could this be a leak in the LSD CPython module, @kba?

@kba
Copy link
Member

kba commented May 2, 2022

Could this be a leak in the LSD CPython module, @kba?

Totally possible. I did not do any work on pylsd beyond getting it to work as a dependency and publishing to PyPI.

@bertsky
Copy link
Contributor Author

bertsky commented May 12, 2022

The only workaround ATM is to process smaller page ranges. But unless you use numerical page IDs, this will be quite difficult with the OCRD CLI. (The problem being find_files does not support regex search for pageId …)

@bertsky
Copy link
Contributor Author

bertsky commented May 12, 2022

(The problem being find_files does not support regex search for pageId …)

see OCR-D/core#855

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants