Apples

BCB 546x Final Project paper reproduction

Team: Abby Schaefer, Clayton Carley, Grant Nickles, Jazelli Muetherthies, and Matt Kohane

The goal of this project is to reproduce the analysis in the paper though using the provided 16S data which is available at https://www.ebi.ac.uk/ena/browser/view/PRJEB32455.

Using this Repository:

Familiarize yourself with the original paper.
Install R studio.
Use read_phyloseq.R to read the phyloseq object containing the data (seeds.RDS or apples.RDS) into R.
Use Build_Pyloseq_Obj_Pipeline.rmd to input the raw unzipped FASTQ files from the ENA website and convert them to phyloseq objects with or without phylogenic trees.
Use Clean_out_low_abundance_taxa.rmd to threshold taxa which have above .01% read counts and convert the phyloseq objects to dataframes.
Use each 'figure' folder to observe the pipelines build for recreating each figure and thier outputs.

Information about Taxonomy Assignment:

Download zipped fastq files from the ENA.
Download the Silva taxonomic training data ("silva_nr_v132_train_set.fa.gz")for dada2: https://zenodo.org/record/1172783#.Xefhh-hKi71
Follow the instructions in alignment_dada2.Rmd to create a phyloseq object for downstream analysis.

Name		Name	Last commit message	Last commit date
Latest commit History 78 Commits
FIgure_5		FIgure_5
Figure_1		Figure_1
Figure_3		Figure_3
Figure_4		Figure_4
New Figures		New Figures
PhyloObj		PhyloObj
PhyloObj_WTree		PhyloObj_WTree
reduced_data_frames		reduced_data_frames
.gitignore		.gitignore
Apples.Rproj		Apples.Rproj
Build_Pyloseq_Obj_Pipeline.Rmd		Build_Pyloseq_Obj_Pipeline.Rmd
Clean_out_low_abundance_taxa.Rmd		Clean_out_low_abundance_taxa.Rmd
README.md		README.md
read_phyloseq.R		read_phyloseq.R

Provide feedback