-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Should irods check
validate the stored data or against the md5 file
#185
Comments
@ericblanc20 @holtgrewe Input would be much appreciated |
I am not sure I understand what you propose to do. I may be mistaken, but I understand that:
In functional analysis projects, it is often valuable to be able to verify that the local analysis files (on the cluster) are identical to those stored on SODAR, especially when the analysis report had been re-run. |
Thanks. The issue is that currently the checksum for any individual file is stored in both individual Given your use-cases at no point should the |
This is an interesting point and maybe @mikkonie can chime in on this once he's back from vacation. Why do we actually move the .md5 files into the main iRODS storage? They are only needed for landing zone validation and could be discarded afterwards as the hashsums are also stored in the iRODS metadata. Edit: I guess there is some use in having them readily available for another check after downloading data from SODAR (especially when not using iRODS tools i.e. Davrods), but this then begs the question why they're not shown in the "List files" web view. |
Currently all
check
commands for irods work against the separately storedmd5
file. This is similar to what is being done by the sodar server commands. After moving a landing zone, there should be no additional need to manually validate these files.These commands duplicate logic already contained in irods, as validation of replica checksums against the stored data is already part of irods itself.
Unless there are
sodar
independent workflows which require manual validation of uploadedmd5
files, I would propose replacing the checks with nativeirods
checksum checks in cubi-tk.This affects
irods/check
,sea-snap/check_irods
andsnappy
.The text was updated successfully, but these errors were encountered: