- This repository can be used for the analyses of 16S sequencing data via
DADA2
using asnakemake
workflow
git clone https://github.com/susheelbhanu/16S_DADA2.git
- You might want to adjust some settings in the files
config/sbatch.sh
andconfig/config.yaml
, e.g. the name of thesnakemake
conda
environment, paths and number of cores/threads.
- There is no other option but to create a metadata file outside of this workflow and import
- NOTE: the first column of the metadata file should be named,
"Sample"
- Import file and place in the
config
folder
- Edit
line 15
, i.e. path to the fastq.gz files in thenotes/samples_tsv.sh
script - From the
16S_DADA2
folder run the following:
bash notes/samples_tsv.sh
# install miniconda3
wget https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh
chmod u+x Miniconda3-latest-Linux-x86_64.sh
./Miniconda3-latest-Linux-x86_64.sh # follow the instructions
# create venv
conda env create -f requirements.yml -n "YourEnvName"
# in an interactive session
./config/sbatch.sh
# as a slurm job
sbatch ./config/sbatch.sh