Kewalin Samart1,2,4,*, Phoebe Tuyishime1,3,4,*, Arjun Krishnan4,5,#, Janani Ravi1,#
1Pathobiology and Diagnostic Investigation, Michigan State University; 2Mathematics, Michigan State University; 3Food Science and Nutrition, Michigan State University; 4Computational Mathematics, Science and Engineering; 5Biochemistry and Molecular Biology, Michigan State University.
*These authors contributed equally to this work.
#Corresponding authors: [email protected]; [email protected]
The basis of several recent methods for drug repurposing is the key principle that an efficacious drug will reverse the disease molecular 'signature' with minimal side-effects. This principle was defined and popularized by the influential 'connectivity map' study in 2006 regarding reversal relationships between disease- and drug-induced gene expression profiles, quantified by a disease-drug 'connectivity score.' Over the past 15 years, several studies have proposed variations in calculating connectivity scores towards improving accuracy and robustness in light of massive growth in reference drug profiles. However, these variations have been formulated inconsistently using various notations and terminologies even though they are based on a common set of conceptual and statistical ideas. Therefore, we present a systematic reconciliation of multiple disease-drug similarity metrics (ES, css, Sum, Cosine, XSum, XCor, XSpe, XCos, EWCos) and connectivity scores (CS, RGES, NCS, WCS, Tau, CSS, EMUDRA) by defining them using consistent notation and terminology. In addition to providing clarity and deeper insights, this coherent definition of connectivity scores and their relationships provides a unified scheme that newer methods can adopt, enabling the computational drug-development community to compare and investigate different approaches easily. To facilitate the continuous and transparent integration of newer methods, this article will be available as a live document (https://jravilab.github.io/connectivity_scores) coupled with a GitHub repository (https://github.com/jravilab/connectivity_scores) that any researcher can build on and push changes to.
drug repositioning/repurposing | disease gene signature | drug profile | CMap and LINCS L1000 | similarity metrics | connectivity mapping | transcriptomics
- Connectivity mapping is a powerful approach for drug repurposing based on finding drugs that reverse the transcriptional signature of a disease, quantified by a connectivity score.
- Though a number of similarity metrics and connectivity scores have been proposed until now, they have been described using inconsistent notations and terminologies to refer to a common set of concepts and ideas.
- Here, we present a coherent definition of multiple connectivity scores using a unified notation and terminology, along with delineating the clear relationship between these scores.
- Our unified scheme can be adopted easily by newer methods and used for systematic comparisons.
- The live document and GitHub repository will allow continuous incorporation of newer methods.
arXiv | PDF | HTML Live Doc
- Kewalin Samart is a student majoring in Computational Mathematics, and Computational Math, Science & Engineering at Michigan State University. Her research currently focuses on disease and drug expression signatures and host-directed drug repurposing.
- Phoebe Tuyishime is a recent graduate from the College of Agriculture and Natural Resources at Michigan State University. Her research focuses on infectious diseases, computational drug repurposing, and host-directed therapeutics. She will be starting her MS at Univ. British Columbia soon.
- Arjun Krishnan is an Assistant Professor in the Departments of Computational Math, Science & Engineering, and Biochemistry & Molecular Biology at Michigan State University. His group works broadly in the areas of computational genomics and biomedical data science.
- Janani Ravi is an Assistant Professor in Pathobiology and Diagnostic Investigation at Michigan State University. Her research group focuses on protein sequence-structure-function relationships, comparative genomics, and drug repurposing as they relate to infectious diseases.
- Kewalin Samart | @KewalinSamart
- Phoebe Tuyishime | @TuyishimePhoebe
- Arjun Krishnan | @compbiologist | [email protected]
- Janani Ravi | @janani137 | [email protected]