You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Context of issue:
When we do trait mapping (automated and manual), we use only preferred names, but when we annotate we attempt to use all names. Because we retain previous mappings even if they don't appear (i.e. don't appear among preferred names in current ClinVar), this means obsolete mappings can be not just retained but also used without being updated.
# preferred name yields up-to-date mapping
$ grep -i '^Malignant tumor of urinary bladder' latest_mappings.tsv
malignant tumor of urinary bladder http://purl.obolibrary.org/obo/MONDO_0004986 urinary bladder carcinoma
# alternate name yields obsolete mapping
$ grep -i '^Urinary bladder cancer' latest_mappings.tsv
urinary bladder cancer http://www.ebi.ac.uk/efo/EFO_0000292 bladder carcinoma
In #383 we modified annotated XML generation to use only preferred names, observing that it decreased coverage of traits only slightly while decreasing the number of obsolete EFO terms used significantly.
The goal of this issue is to see what is the impact of making a similar change for OT evidence string generation (which is more complicated due to how it groups and explodes traits), and if it is acceptable make the change.
The text was updated successfully, but these errors were encountered:
Context of issue:
When we do trait mapping (automated and manual), we use only preferred names, but when we annotate we attempt to use all names. Because we retain previous mappings even if they don't appear (i.e. don't appear among preferred names in current ClinVar), this means obsolete mappings can be not just retained but also used without being updated.
Example - in ClinVar:
In latest mappings:
In #383 we modified annotated XML generation to use only preferred names, observing that it decreased coverage of traits only slightly while decreasing the number of obsolete EFO terms used significantly.
The goal of this issue is to see what is the impact of making a similar change for OT evidence string generation (which is more complicated due to how it groups and explodes traits), and if it is acceptable make the change.
The text was updated successfully, but these errors were encountered: