MUVIG: MUlti View on Imagin Genetics

Imaging genetics workflow to find Parkinson's Disease potential genetic biomarkers, combining genetics, transcriptomics and imaging data. Above is presented the workflow of this project.

Overview

The data analyzed comes from the Parkinson’s Progression Marker Initiative (PPMI: link). It contains the most complete and comprehensive collection of PD-related data. PPMI aims to identify new potential biomarkers of progression for PD through longitudinal studies, which use and correlate data from different sources, in order to enhance the development of new therapies and treatments. PPMI is sponsored by the Michael J. Fox Foundation for Parkinson’s Research, and gathers a huge amount of imaging, genetic and neurobehavioral data collected by many research centers in North America, Europe, Israel and Australia. Among the available genetic data in PPMI, genotyping and transcriptomic data provide a comprehensive perspective of which roles play in PD genetic variation, genes expression and gene-gene interactions.

Data

To test this workflow we used genetic (genotyping and transcriptomic) and imaging data (DaTscan and MRI) avaiable at PPMI data portal.

Genotyping data consists of a set of DNA sequence polymorphism (SNPs and indels), available in two datasets: ImmunoChip and NeuroX. Both of them are in the plink binary format.
Transcriptmoic data consists of different counts data per patient (.txt file).
Imaging data consists of two set of different images: DaTscan, measuring the amount of dopamine transporter in four regions of the brain striatum (right/left caudate and putamen), and MRI, containing the morphological information of each brain region. Both of these file are in the csv format. The file related to the DaTscan data is just avaiable from PPMI, instead the one of the MRI was produced after processing the imaging.

Usage

The project was developed using python inside jupyter notebook. For the generation of the quantile-quanitile and the Manhattan plots was used R lunched inside the notebook as well. The third part related to the Functional Interpretation step was totally engineered using R.

Installation

In order to run correctly the complete workflow you need to install the followin programs:

PLINK

PLINK is avaialble at PLINK download. Are avaiable ZIP files containing binaries compilied on various platforms as well as the C/C++ source code. Linux/Unix users should download the source code and compile. The downloads also contain a version of gPLINK, an (optional) GUI for PLINK. PLINK is available as a Debian package

The following R packages:

BiocManager::install(c("magrittr","clusterProfiler","Homo.sapiens","AnnotationDbi","EnsDb.Hsapiens.v75","fgsea","BiocParallel"))
install.packages(c("qqman","RNOmni","edgeR","variancePartition","tidyverse","devtools","ggplot2","MKmisc"))
install_bitbucket("mdonohue/grace")

The following Python modules:

pip install scipy.stats matplotlib_venn pandas matplotlib.pyplot seaborn math

The following tool:

Tates: avaiable at Tates download

Input example

To run the first two step of the analysis (Individual View and Functional Interpretation), we passaed as input 2 files in PLINK binary format, regarding two different microarray chip. For each file, in PLINK binary formay, you would have 3 different files:

.bim (PLINK extended MAP file)
.bed (PLINK binary biallelic genotype table)
.fam (PLINK sample information file)

For the third phase (Functional Interpretation) regarding the RNA-Seq analysis as input were used different counts file(simply the number of reads overlapping a given gene), joined together in the RNASeq.R file.

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

Please make sure to update tests as appropriate.

Support

You can contact directly the authors by their e-mail addresses.

Authors and acknowledgment

Guglielmo Cerri ([email protected]) and Manuel Tognon([email protected]) created the workflow. Thanks also to Alessia Leanza that created the workflow image.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 154 Commits
ICHI2021Tutorial		ICHI2021Tutorial
docs		docs
src		src
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MUVIG: MUlti View on Imagin Genetics

Overview

Data

Contents

Usage

Installation

PLINK

The following R packages:

The following Python modules:

The following tool:

Input example

Contributing

Support

Authors and acknowledgment

License

About

Releases

Packages

Contributors 2

Languages

License

InfOmics/MUVIG

Folders and files

Latest commit

History

Repository files navigation

MUVIG: MUlti View on Imagin Genetics

Overview

Data

Contents

Usage

Installation

PLINK

The following R packages:

The following Python modules:

The following tool:

Input example

Contributing

Support

Authors and acknowledgment

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages