Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Duplicates in scene_list #18

Open
jflasher opened this issue Nov 11, 2016 · 2 comments
Open

Duplicates in scene_list #18

jflasher opened this issue Nov 11, 2016 · 2 comments

Comments

@jflasher
Copy link

Have seen a couple of duplicates showing up in scene_list.gz. Doesn't seem to be tied to date. Maybe items are getting queued up twice?

$ grep LC80200312015200LGN00 scene_list              
LC80200312015200LGN00,2015-07-19 16:16:07.837833,65.39,L1T,20,31,40.62882,-85.17706,42.79844,-82.23444,https://s3-us-west-2.amazonaws.com/landsat-pds/L8/020/031/LC80200312015200LGN00/index.html
LC80200312015200LGN00,2015-07-19 16:16:07.837833,65.39,L1T,20,31,40.62882,-85.17706,42.79844,-82.23444,https://s3-us-west-2.amazonaws.com/landsat-pds/L8/020/031/LC80200312015200LGN00/index.html
@kapadia
Copy link
Member

kapadia commented Nov 11, 2016

@jflasher Yes, it's likely scenes are getting queued twice. We're currently back filling the archive, and though, there is some effort towards avoiding duplicates, it's still possible that we ingest the same scene multiple times.

After the back catalog is fully ingested, I'll prune the duplicates from the scene_list. It'll be about 2 months.

@jflasher
Copy link
Author

👍 thanks @kapadia.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants