You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Each supplementary concept can have a "mapped to" property that links it to one or more primary concepts. These mappings are usually to the closest match in the list of primary concepts and there are two typical types of non-exactness: a) sometimes specific proteins are mapped to primary concepts representing families of proteins e.g., NASPP1 protein, human (https://meshb.nlm.nih.gov/record/ui?ui=C489391) is mapped to Autoantigens and Nuclear Proteins.
b) the species-specific supplementary concepts are linked to a non-species-specific primary concept. For instnace, the above 3 terms for species-specific MAPK1 are all mapped to https://meshb.nlm.nih.gov/record/ui?ui=D019950.
Some complicated observations made by @steppi a few months ago: The supplementary concepts are explicitly called proteins, e.g., MAPK1 protein, human. The primary concepts aren't explicit about this but there are often clues to them being proteins rather than genes, e.g., A serine/threonine-specific protein kinase which is encoded by the CHEK1 gene in humans. (https://meshb.nlm.nih.gov/record/ui?ui=D000071877). Then there are some complicated cases related one-to-many gene/protein relationships for instance, due to splice variants. For instance for https://meshb.nlm.nih.gov/record/ui?ui=D064546 we have
PKC beta encodes two proteins (PKCB1 and PKCBII) generated by alternative splicing of C-terminal exons.
meaning that this primary concept represents two proteins from the same gene. In another example, we have estrogen receptor alpha 36, human (https://meshb.nlm.nih.gov/record/ui?ui=C000601334) and "estrogen receptor alpha, human" (https://meshb.nlm.nih.gov/record/ui?ui=C506487) as two separate entries that would correspond to separate entries in the uniprot.isoform namespace, though whether the second one can be mapped at all is questionable.
Overall, I'm fairly convinced that both the primary and supplementary concepts should be interpreted as proteins, and that the primary concepts are non-species-specific whereas the supplementary concepts are (explicitly) species specific. Consequently, mappings such as
This is to start a discussion about gene/protein entries in MeSH.
MeSH has supplementary concepts representing human, mouse, and rat-specific proteins. Examples (MAPK1):
https://meshb.nlm.nih.gov/record/ui?ui=C535150
https://meshb.nlm.nih.gov/record/ui?ui=C535148
https://meshb.nlm.nih.gov/record/ui?ui=C535149
Each supplementary concept can have a "mapped to" property that links it to one or more primary concepts. These mappings are usually to the closest match in the list of primary concepts and there are two typical types of non-exactness: a) sometimes specific proteins are mapped to primary concepts representing families of proteins e.g.,
NASPP1 protein, human
(https://meshb.nlm.nih.gov/record/ui?ui=C489391) is mapped to Autoantigens and Nuclear Proteins.b) the species-specific supplementary concepts are linked to a non-species-specific primary concept. For instnace, the above 3 terms for species-specific MAPK1 are all mapped to https://meshb.nlm.nih.gov/record/ui?ui=D019950.
Some complicated observations made by @steppi a few months ago: The supplementary concepts are explicitly called proteins, e.g.,
MAPK1 protein, human
. The primary concepts aren't explicit about this but there are often clues to them being proteins rather than genes, e.g.,A serine/threonine-specific protein kinase which is encoded by the CHEK1 gene in humans.
(https://meshb.nlm.nih.gov/record/ui?ui=D000071877). Then there are some complicated cases related one-to-many gene/protein relationships for instance, due to splice variants. For instance for https://meshb.nlm.nih.gov/record/ui?ui=D064546 we havemeaning that this primary concept represents two proteins from the same gene. In another example, we have
estrogen receptor alpha 36, human
(https://meshb.nlm.nih.gov/record/ui?ui=C000601334) and "estrogen receptor alpha, human" (https://meshb.nlm.nih.gov/record/ui?ui=C506487) as two separate entries that would correspond to separate entries in theuniprot.isoform
namespace, though whether the second one can be mapped at all is questionable.Overall, I'm fairly convinced that both the primary and supplementary concepts should be interpreted as proteins, and that the primary concepts are non-species-specific whereas the supplementary concepts are (explicitly) species specific. Consequently, mappings such as
(https://github.com/biomappings/biomappings/blob/master/src/biomappings/resources/mappings.tsv#L142) ought to be changed.
The text was updated successfully, but these errors were encountered: