Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The parser is unable to handle big files #168

Open
Bernardo-MG opened this issue Aug 4, 2015 · 2 comments
Open

The parser is unable to handle big files #168

Bernardo-MG opened this issue Aug 4, 2015 · 2 comments
Labels

Comments

@Bernardo-MG
Copy link
Collaborator

Currently the parser is able to handle only files of a handful MBs.

It has been verified to work with files up to 50MB, and takes around 3 minutes for each 10MBs.

The reason behind this problem seems to be the Pyparsing library. But swapping it for another one would mean rewriting the factory, and the configuration DSL.

@Bernardo-MG Bernardo-MG added the bug label Aug 4, 2015
@Bernardo-MG Bernardo-MG added this to the CWR File Parser milestone Aug 4, 2015
@Alexkane
Copy link

Would it be conceptually possible to break this process down and run it in parallel to improve performance?

@Bernardo-MG
Copy link
Collaborator Author

In my little experience with parallelizing, it isn't so easy to take advantage of.

Also, first of all the parser itself would need to split the file, and then decide how to handle these pieces before continuing with the parsing.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants