-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Spike: Investigate why University of Virginia's Dataverse installation (Libra Data) can't harvest a set from Harvard Dataverse Repository #206
Comments
I re-added Harvard as a Harvesting Client and it seemed to work (at least the whole thing did not fail). Error with the client I set up years ago just "FAILED"
@jggautier so I think this issue can be closed. Solution was to create a new client on the dashboard. |
Awesome! Glad the harvesting works now. Is it helpful to learn why the old client wasn't working, but the new client is? Is that even possible at this point? Should recreating a failing set be the kind of troubleshooting method we recommend, the kind of troubleshooting where we don't really know why it works (like turning something off and turning it on again)? |
In a meeting this morning (IQSS/dataverse-pm#24) @landreev said that yes, installation admins should consider recreating a failing set when troubleshooting harvesting issues like these. I'll close this issue |
@jggautier leaving a note here that my harvest (with the newly created set - on January 10th - which was a success on that day) failed today, Jan. 19th when I ran it. I had not set up a scheduled harvest. Today was the first time since then that I ran it; I manually ran it today. This was not due to a recent upgrade on UVa's Dataverse repo So how often am I going to have to re-create a new harvest set? Running the client created on Jan 10, immediately failed... there was no "in progress" Running the client I just created (Jan 19) with all the same configurations as before, worked, showed "in progress" until it was completed: |
Sorry to hear, @shlake, and thanks for the update and the question. Ideally you'd have to recreate the client just once! Hopefully hearing that the re-created harvesting client fails when trying to harvest, after the first run was "successful", helps with troubleshooting. Reopening this issue so it's easier to find. |
This is alarming. I do recommend to remove and recreate a client that may have sat around for a long time, when the server on the other end may have been upgraded a few times since the last successful harvest, etc. - as a one off remedy that may fix it. But no, it should not be expected to have to rebuild it from scratch every week! |
(this may or may not need a bug fix issue in the main dev. project - but yes, let's look at the log first) |
@landreev here is a link where I put 3 harvest log files - https://virginia.box.com/s/osi8ujbgcplxxlkehhmkd2jacxh5zu3u
thanks for looking into this. |
Thank you @shlake. |
@landreev just an FYI.... Harvard is still at 5.12 (not 5.12.1), right? |
Harvard is in fact running 5.12.1 in all but name. (it's running a custom, and colorfully-named emergency build that was later released as 5.12.1). |
I produced a (very simple) fix for this. We'll just need to figure out the logistics of applying it to our prod. I will keep you posted (I need to post about this in the Google group too). |
@shlake Could you please check and confirm that you can harvest from us again, now that we are running 5.13 (can harvest from us more than once, that is)? - Thank you! |
@landreev Yes, it worked. The harvest found 1 new record and no errors. 👍 |
OK, thanks, that's great to hear. |
Sherry Lake emailed Dataverse support that the University of Virginia repository she manages, Libra Data at https://dataverse.lib.virginia.edu, can't harvest a set from the Harvard Dataverse Repository. When Sherry tries, using the dataverse_json and the ddi metadata formats, no dataset records are harvested into Libra Data.
Libra Data is using Dataverse v5.11.1. Harvard's repository is using 5.12.
The set from Harvard's repository is called UVA_Authored_Datasets.
In the email thread at https://help.hmdc.harvard.edu/Ticket/Display.html?id=331127, Sherry attached a log with more technical information about the harvesting attempt.
Lastly, when I tried to harvest the set from Harvard's repository into Demo Dataverse, some records are harvested but some failed to be harvested. I'm not sure if it's helpful for this case to know why Demo Dataverse is able to harvest some records but Libra Data can't harvest any.
Definition of done:
The text was updated successfully, but these errors were encountered: