Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

memory consumption (exit code 137) #383

Open
rgaudin opened this issue Sep 3, 2024 · 1 comment
Open

memory consumption (exit code 137) #383

rgaudin opened this issue Sep 3, 2024 · 1 comment
Labels
bug Something isn't working question Further information is requested
Milestone

Comments

@rgaudin
Copy link
Member

rgaudin commented Sep 3, 2024

This is not my first time of seeing this ; now I am sure there are no moving pieces involved so it's time for a ticket.

this zimit run exhausted the 3.75GB or RAM during warc2zim, so after the browser process has been closed.

warc2zim should not consume this much memory of course. Binary data should be passed in chunk and the data it manipulates (to-rewrite text) should not be this large.

Given this is a scrape of one of our mirrors (🙄), either we are loading binaries in memory or some content have incorrect types leading to them being loaded for rewriting (or both). To be investigated.

@rgaudin rgaudin added bug Something isn't working question Further information is requested labels Sep 3, 2024
@benoit74 benoit74 added this to the 2.2.0 milestone Sep 3, 2024
@benoit74 benoit74 changed the title memory consumption memory consumption (exit code 137) Oct 28, 2024
@benoit74
Copy link
Collaborator

A "more interesting" case: https://farm.zimit.kiwix.org/pipeline/dec7c862-dafa-4f5f-b0a4-0bc2a3a81200/debug which is crawling what looks like a more legitimate website https://crustywindo.ws/

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants