Stop/resume fetching process #12

ghost · 2019-09-22T17:31:33Z

Is it safe to stop and then resume the fetching process with ctrl+c?
Can that lead to data corruption or inconsistency?
Does fetch_links.py properly save data before exiting?

The text was updated successfully, but these errors were encountered:

libertysoft3 · 2019-09-26T04:10:02Z

Yes, it is safe to quit and re-start, although it is not well tested. I would personally wait until the script outputs "got %s links, wrote %s and %s comments", which happens every 10 links/posts. Waiting for this will avoid quitting the script while write_links() is running.

Since data is written every 10 links/posts, by quitting the most data you will lose is 10 links.

libertysoft3 · 2019-09-26T04:11:51Z

If you know how to improve this I'm all ears. I suck at python.

libertysoft3 · 2019-09-26T04:39:02Z

Looking at this in more detail, it is safe to re-run a data fetch, but it's not smart at all. It will re-download all of the data and just refuse to write it to disk since it's already there. (edit: it will skip fetching comments if the link is already written to file)

So for now, for sanity/efficiency, you have to check to see the last date that you downloaded data for, and start your next run on that same date.

I think a 'resume' flag could be added that will skip ahead to by date based on what's already on disk. Maybe we can improve on ctrl+c as well. We can leave this one open.

libertysoft3 added enhancement New feature or request bug Something isn't working labels Oct 3, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stop/resume fetching process #12

Stop/resume fetching process #12

ghost commented Sep 22, 2019

libertysoft3 commented Sep 26, 2019

libertysoft3 commented Sep 26, 2019

libertysoft3 commented Sep 26, 2019 •

edited

Loading

Stop/resume fetching process #12

Stop/resume fetching process #12

Comments

ghost commented Sep 22, 2019

libertysoft3 commented Sep 26, 2019

libertysoft3 commented Sep 26, 2019

libertysoft3 commented Sep 26, 2019 • edited Loading

libertysoft3 commented Sep 26, 2019 •

edited

Loading