Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

HOCR python binary parser #20

Open
raphidoc opened this issue Dec 7, 2022 · 0 comments
Open

HOCR python binary parser #20

raphidoc opened this issue Dec 7, 2022 · 0 comments
Labels
dependency abandon hope all help wanted Extra attention is needed time consuming Time consuming computation

Comments

@raphidoc
Copy link
Owner

raphidoc commented Dec 7, 2022

The kaïtai python binary parser for HOCR takes ~3/4 min (on my machine), it would be good to speed up the process and remove dependency on python at the same time.

To do that we could use the cpp_stl version of the kaïtai parser, and use either:

  1. Rcpp
  2. cpp11

Which way to integrate C++ should be used?

My spec :
description: CPU
product: Intel(R) Core(TM) i7-9750H CPU @ 2.60GHz
vendor: Intel Corp.
physical id: 4
bus info: cpu@0
version: 6.158.10
serial: To Be Filled By O.E.M.
slot: U3E1
size: 4085MHz
capacity: 4500MHz
width: 64 bits
clock: 100MHz

@raphidoc raphidoc added help wanted Extra attention is needed time consuming Time consuming computation dependency abandon hope all labels Dec 7, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
dependency abandon hope all help wanted Extra attention is needed time consuming Time consuming computation
Projects
None yet
Development

No branches or pull requests

1 participant