Genome Gerrymandering

Optimal divison of the genome into regions with cancer type specific differences in mutation rates

Abstract

The activity of mutational processes differs across the genome, and is influenced by chromatin state and spatial genome organization. At the scale of one megabase-pair (Mb), regional mutation density correlates strongly with chromatin features, and at this scale can be used to accurately identify cancer type. Here, we explore the relationship between genomic region and mutation rate by developing an information theory driven, dynamic programming algorithm for dividing the genome into regions with differing relative mutation rates between cancer types. Our algorithm improves mutual information when compared to the naive approach, effectively reducing the average number of mutations required to identify cancer type. This approach provides an efficient method for associating regional mutation density with mutation labels, and has future applications in exploring the role of somatic mutations in a number of diseases.

Requirements

Python 3.7

numpy, scipy, matplotlib, seaborn

C

gcc, openmp

PSB paper

link: https://psb.stanford.edu/psb-online/proceedings/psb20/Young.pdf

Final paper

The final version of the paper is the file genome_gerrymandering_thesis_version.pdf and can be found in the repo

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
Makefile		Makefile
README.md		README.md
analyze.py		analyze.py
chrm_figure_demo.ipynb		chrm_figure_demo.ipynb
chromosome.py		chromosome.py
data_file_desc.txt		data_file_desc.txt
entropy_estimator_demo.ipynb		entropy_estimator_demo.ipynb
genome_gerrymandering_thesis_version.pdf		genome_gerrymandering_thesis_version.pdf
k_seg.c		k_seg.c
k_seg.h		k_seg.h
preproc_mp.py		preproc_mp.py
run.py		run.py
segmentation.c		segmentation.c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Genome Gerrymandering

Abstract

Requirements

Python 3.7

C

PSB paper

Final paper

About

Releases

Packages

Contributors 2

Languages

adamoyoung/MutSeg

Folders and files

Latest commit

History

Repository files navigation

Genome Gerrymandering

Abstract

Requirements

Python 3.7

C

PSB paper

Final paper

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages