WGS pipeline for microbiome profiling from biopsy samples

Computational pipelines for filtering host DNA and evaluating the microbiome abundance from biopsy WGS samples.

Citing our paper:

Chao Zhang, Kyle Cleveland, Felice Schnoll-Sussman, Bridget McClure, Michelle Bigg, Prashant Thakkar, Nikolaus Schultz, Manish A. Shah, and Doron Betel. "Identification of low abundance microbiome in clinical samples using whole genome sequencing." Genome Biology 16 (2015).

Installing prerequisite software and corresponding databases

BWA

Download: BWA 0.6.2
hg19 Human reference genome: http://hgdownload.soe.ucsc.edu/goldenPath/hg19/chromosomes/
Additional human genomes: ftp://ftp.ncbi.nlm.nih.gov/genomes/H_sapiens//ARCHIVE/BUILD.37.3/Assembled_chromosomes/seq/

RepearMasker

Download: RepeatMasker 4.0.5
Database: [Repbase for RepeatMasker] (http://www.girinst.org/)

BLAST & MegaBLAST 2.2.27

Download: ftp://ftp.ncbi.nlm.nih.gov/blast/executables/blast+/2.2.27/
Database: [Combined human sequence database] (http://www.girinst.org/)

Bowtie2

Download: Bowtie2 2.2.5
NCBI bacteria compelete genomes: ftp://ftp.ncbi.nlm.nih.gov/genomes/Bacteria/

PathoScope

Download: PathoScope 2.0

R

Download: R 3.0.2

samtools

Download: samtools 0.1.19

Setting the parameters

Set the correct paths to above software and databases in configs.sh

Filtering host DNA

Step 1. Align all reads to hg19 Human reference genome with BWA
./Filter_S1.sh
Step 2. Align the remaining unmapped reads to additional human genomes with BWA
./Filter_S2.sh
Step 3. Filter the remaining unmapped reads with RepearMasker
./Filter_S3.sh
Step 4. Align the remaining reads to human sequence database with BLAST
./Filter_S4.sh
Step 5. Align the remaining reads to human sequence database with MegaBLAST
./Filter_S5.sh

Microbiome profiling

Step 1. Align fully filtered reads to NCBI bacteria compelete genomes with Bowtie2
Step 2. Calculate the genome coverages to further remove false identification
Step 3. Assign the multiple mapped the reads with PathoScope
Step 4. Re-evaluate the relative abundances of microbiome
./Profiling.sh

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
Scripts		Scripts
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WGS pipeline for microbiome profiling from biopsy samples

Citing our paper:

Installing prerequisite software and corresponding databases

BWA

RepearMasker

BLAST & MegaBLAST 2.2.27

Bowtie2

PathoScope

R

samtools

Setting the parameters

Filtering host DNA

Microbiome profiling

About

Releases

Packages

Languages

zhangch/WGSpipeline

Folders and files

Latest commit

History

Repository files navigation

WGS pipeline for microbiome profiling from biopsy samples

Citing our paper:

Installing prerequisite software and corresponding databases

BWA

RepearMasker

BLAST & MegaBLAST 2.2.27

Bowtie2

PathoScope

R

samtools

Setting the parameters

Filtering host DNA

Microbiome profiling

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages