- All raw reads in fastq format should be placed in fastq/
- Install dependencies by conda:
conda create -n HA1 -c bioconda -c anaconda -c conda-forge \
python=3.9 \
seqtk \
flash \
biopython \
cutadapt \
snakemake
- Activate conda environment:
source activate HA1
-
Using UMI to correct sequencing errors:
python script/Dedup_UMI.py fastq NNNNNNN 0.8 2
-
Counting mutations:
snakemake -s HA1_pipeline.smk -j 10