Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Audit physical files in production, identify the missing legacy files, and either remove or work with the authors to reupload such. #220

Open
charmoniumQ opened this issue Apr 6, 2023 · 6 comments
Labels
FY25 Sprint 9 FY25 Sprint 9 (2024-10-23 - 2024-11-06) FY25 Sprint 10 FY25 Sprint 10 (2024-11-06 - 2024-11-20) Size: 33 A percentage of a sprint.

Comments

@charmoniumQ
Copy link

What steps does it take to reproduce the issue?

  • Which page(s) does it occurs on?

https://dataverse.harvard.edu/file.xhtml?persistentId=doi:10.7910/DVN/5UMCKK/2FO3JJ&version=2.2

  • What happens?

When I click to download the file, nothing gets downloaded.

Curling the URL directly shows: {"status":"ERROR","code":404,"message":"Datafile 3080259: Failed to locate and/or open physical file."}

  • To whom does it occur (all users, curators, superusers)?

All users

  • What did you expect to happen?

Dataverse would initiate file downloading. At the very list, the browser should show an error message.

Which version of Dataverse are you using?

v. 5.13 build 1244-79d6e57

@charmoniumQ
Copy link
Author

@atrisovic

@landreev
Copy link
Collaborator

I'm going to move this to the "local" github queue specifically for Harvard production issues; since this is a missing file in our production instance, not a bug in the Dataverse application.

@landreev landreev transferred this issue from IQSS/dataverse Apr 18, 2023
@landreev
Copy link
Collaborator

Hello,
This is a genuinely missing file (the record of it exists in the database, but the physical file is missing in storage). This one has been on the radar, or was at some point at least, I found some traces of conversations about it. We believe that the file was never there, it was never saved, never ended up on any backups. Possibly a download that got caught in a server crash. Or a result of some old bug. At some point an attempt was made to contact the author, to see if the file could be re-uploaded, or purged from the database.
The person who tried communicating with them is no longer part of the project. I don't know what happened, but I'm guessing there was no response.

So, it's a messy legacy case. There are a few more like this, unfortunately and we have limited resources for dealing with things of this nature. But thanks for reminding us, I will make another effort to see if it can be reuploaded (or even deleted quietly?)

@sbarbosadataverse sbarbosadataverse added the bug Something isn't working label May 15, 2024
@sbarbosadataverse
Copy link

@landreev Do we have the record of the others? I will make a way to prioritize finding the data owners.

@cmbz cmbz moved this to SPRINT- NEEDS SIZING in IQSS Dataverse Project May 20, 2024
@landreev
Copy link
Collaborator

Yeah, I'll need to look into this again and compile an accurate list of such legacy cases with missing physical files. Not something I can produce instantly.
Danny was still around the last time we tried to address this. I know he was contacting some authors. But not quite sure how many responded etc.

@landreev landreev removed the bug Something isn't working label May 20, 2024
@landreev landreev changed the title Bug: Unable to download file Audit physical files in production, identify the missing legacy files, and either remove or work with the authors to reupload such. Jul 10, 2024
@landreev landreev added the Size: 33 A percentage of a sprint. label Jul 10, 2024
@cmbz cmbz moved this from SPRINT- NEEDS SIZING to SPRINT READY in IQSS Dataverse Project Jul 10, 2024
@stevenwinship stevenwinship self-assigned this Nov 4, 2024
@stevenwinship stevenwinship moved this from SPRINT READY to In Progress 💻 in IQSS Dataverse Project Nov 4, 2024
@cmbz cmbz added FY25 Sprint 9 FY25 Sprint 9 (2024-10-23 - 2024-11-06) FY25 Sprint 10 FY25 Sprint 10 (2024-11-06 - 2024-11-20) labels Nov 5, 2024
@stevenwinship stevenwinship removed their assignment Nov 14, 2024
@stevenwinship
Copy link
Contributor

See PR for new API to get the missing file info: IQSS/dataverse#11016

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
FY25 Sprint 9 FY25 Sprint 9 (2024-10-23 - 2024-11-06) FY25 Sprint 10 FY25 Sprint 10 (2024-11-06 - 2024-11-20) Size: 33 A percentage of a sprint.
Projects
None yet
Development

No branches or pull requests

5 participants