Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Won't archive old posts' comments #19

Open
SileNce5k opened this issue Jun 17, 2020 · 2 comments
Open

Won't archive old posts' comments #19

SileNce5k opened this issue Jun 17, 2020 · 2 comments

Comments

@SileNce5k
Copy link

SileNce5k commented Jun 17, 2020

When doing fetch_links.py formula1 2014-4-1 2014-4-2 it won't archive any of the comments, just the submissions.

For example:
If you go to this Reddit thread, you will see comments, but it won't be archived.
https://reddit.com/r/formula1/comments/21tvzs/
Archived version

@Marjona6
Copy link

I'm having the opposite problem, where I get all the comments but not the posts themselves. I also get a lot of nearly-blank .csv files in which the entirety of the contents is just this:

author,body,created_utc,id,link_id,parent_id,score,stickied,subreddit_id

@libertysoft3
Copy link
Owner

I think you guys have stumbled across some missing data in pushshift. Maybe they'd appreciate a bug report over there. The formula1 example:

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants