-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix(ingest): Handle grouping changes by also allowing revocation #2372
Conversation
cb038d6
to
1814437
Compare
I think I need to get my head around the concepts a bit more in discussion then re-review. One other Q. I see |
Ah sorry should really have made a bug for this but I did it now, I see this on main always when I revoke sequences: #2401 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks! Sorry for the delay!
resolves #2299
preview URL: https://ingest-fix-grouping-issue.loculus.org/
Summary
This is for multi-segmented genomes only.
Handle edge case when metadata fields change in reingest and previous grouping of segments is no longer valid. Currently we would add new groups and keep old groups, potentially having ingested sequences twice with no warning for users that the metadata has been updated.
This PR adds code that will detect this edge case (grouping change) and provides code to add the new groups and revoke the old ones. As there is a potential risk of over-revocation we should only run the
revoke
rule under manual oversight so although the code is added here I have not added therevoke
rule to therule all
. In a later PR: #2392 I will add code to send us an automated message if such a change is detected and then we can launch a job which runs this snakemake rule.This PR also adds very basic unit tests to ensure that revise and revoke work as expected.
Screenshot
Ran locally with test data (altering metadata to break grouping and cause revision)
and new sequences are added.
PR Checklist