Skip to content

reznik-lab/pancancer-metabolomics

Repository files navigation

pancancer-metabolomics

This repository contains the code to reproduce the findings and figures presented in Benedetti, Liu, Tang, et al., A Multimodal Atlas of Tumor Metabolism Reveals the Architecture of Gene-Metabolite Co-regulation, bioRxiv (2022).

Data

Please source the script 0_DownloadData.R to automatically download the necessary data from Zenodo (DOI: 10.5281/zenodo.7150252).

The generated data structure is as follows:

  • "metabolomics_original" contains the unprocessed metabolomics data
  • "metabolomics_processed" contains the preprocessed metabolomics data
  • "transcriptomics_processed" contains the preprocessed transcriptomics data
  • "TME_deconvolution_processed" contains the immune deconvolution data
  • "flow_sorted_ovarian_metabolomics" contains the ovarian cancer data from Kilgour et al. (2021)
  • "Metabolism_Immune.aDC_exIDO1" contains the dendritic cell signature without IDO1

Code

Data Preprocessing

The preprocessed data can be generated by running the following scripts, in order:

  • 1_PreprocessedMetabo.R
  • 2_ImputeMetabo.R
  • 3_FilterMetabo.R
  • 4_FilterRNA.R

The preprocessed data will be generated and stored in the "results/preprocessed_data/" folder.

Data Analysis

In order to generate the main results presented in the paper, please run the follwing scripts.

Concordance Meta-Analysis

The concordance meta-analysis results can be reproduced by running the following scripts, in order:

  • 5_ConcordanceMetaAnalysis.R
  • 6_ConcordancePathwayEnrichmentAnalysis.R

Please note that script number 5 requires the output of scripts 1-4. Moreover, this script will take a long time to run on a regular laptop with 4 cores (>12h). Script number 6 will take >1h to run on a laptop.

In order to make it easier for users to reproduce our findings without having to run these computationally expensive scripts, we are also providing a pre-computed version of the results generated by these two scripts that will be used to generate the corresponding figures in absence of the output of these scripts. These results are downloaded automatically when running the script 0_DownloadData.R and stored in the data_for_script/ folder.

Tumor Versus Normal Analysis

The tumor vs normal analysis results can be reproduced by running the following script:

  • 7_TumorVsNormalAnalysis.R

Immune Score Analysis

The ImmuneScore and immune cell populations results can be reproduced by running the following script:

  • 8_ImmuneAnalysis.R

This script will take roughly 30 mins to run.

In order to make it easier for users to reproduce our findings without having to run this computationally expensive script, we are also providing a pre-computed version of the results generated by this script that will be used to generate the corresponding figures in absence of the output of this script. These results are downloaded automatically when running the script 0_DownloadData.R and stored in the data_for_script/ folder.

Figures

In order to reproduce the paper figures, please run the following scripts, in order:

  • 9_Figure1-2-4.R This script will generate the panels in Figure 1, 2, 4, as well as Supplementary Figure S1 and S4, and Supplementary Table S1 and S4. This script needs the output of scripts 1-4.
  • 10_Figure3.R This script will generate the panels in Figure 3, as well as Supplementary Table S2 and S3.
  • 11_Figure5-6.R This script will generate the panels in Figure 5 and 6, as well as Supplementary Figure S2 and Supplementary Table S5.