-
Notifications
You must be signed in to change notification settings - Fork 58
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Playwright OS processes remain active after finishing parsing #25
Comments
Hi @gllona . |
This is the
This is
This is the function that does the actual scraping using a custom
Other functions:
Note that the scraper is working file, only that there are playwright processes in the host (outside the docker container) after the scraping finishes. Thanks! |
Hey, @gllona try to use |
I added this feature in that PR: #26 |
Hi!
Today I am running a process parsing ~500 web pages. The parser runs sequentially, no two webpage parsings run concurrently. Parsera is running as a docker container in my local machine with Ubuntu 24.04 on amd64.
I am noticing that after each webpage is parsed, 5 new firefox processes are kept active in my local OS (host level, not inside the docker container. For example:
Each process have this
ps
signature:And the parsera process is running with docker:
My question is: could be possible that Parsera is not closing the playwright process after the parse finishes? Or should I close the playwright instance explicitly? If I should, how can I do it?
Notes: using
parsera==0.1.8
with python 3.12Thank you,
Gorka Llona
The text was updated successfully, but these errors were encountered: