Repository for the scRNA-Seq course of the Danish Health Data Science Sandbox project.
Created: November 2021
NOTE: workshop still under construction
This workshop contains a basic tutorial on how to approach scRNAseq experiments, starting from fastq files out of the sequencer. Thus, the workshop does not include any information of laboratory protocols, library preparation or any wet-lab related procedures.
-
Introduction to single cell RNA-Seq
-
Preprocessing of RNAseq reads (fastq)
- Trimming and filtering (TrimGalore)
- Alignment (STARsolo)
- Demultiplexing (STARsolo)
- Feature count (STARsolo)
- QC (FastQC and MultiQC)
-
Read normalization and QC (R: Seurat)
-
Exploratory analysis and clustering (R: Seurat)
-
Differential Expression Analysis (R: Seurat)
-
Pseudotime and trajectories (R: Seurat/Slingshot)
-
Functional Analysis (R: gprofiler2)
- Knowledge of R, Rstudio and Rmarkdown or Python and Jupyter Notebooks.
- Basic knowledge of scRNAseq technology
- Basic knowledge of data science and statistics such as PCA, clustering and statistical testing
The aim of this repository is to run a comprehensive but introductory workshop on sc-RNAseq bioinformatic analyses. Each of the modules of this workshop is accompanied by a powerpoint slideshow explaining the steps and the theory behind a typical bioinformatics analysis (ideally with a teacher). Many of the slides are annotated with extra information and/or point to original sources for extra reading material.
The example Rmarkdown compiles modules 3-7 and can be used as a stand-alone template for a standard scRNA-Seq analysis. The analysis is in R is based on a collection of modified tutorials from the nf-core pipeline, the Seurat, gProfiler2 and slingshot vignettes.
- Samuele Soraggi. Python Data Scientist. University of Aarhus
- Jose Alejandro Romero Herrera. R Data Scientist. University of Copenhagen
- Center for Health Data Science, University of Copenhagen.