This course is intended for trainees in the Genetic Counselling MSc Program at the University of British Columbia, and summer students within the Summer Student Research Program at BC Children's Hospital Research Institute. The computing resources are hosted at BC Children's Hospital Research Institute.
For more information contact Dr. Phillip Richmond ([email protected]). If you are looking for content related to the manuscript, please see Release 1. This version: November 2022
The course will be designed in a flexible format, adaptable for teaching online, or in person, or a combination of both.
Setting up the working environment.
- Content
- Installing necessary software (see below)
- Connecting to the cluster
- Making a workspace
- Slides: Link
- For mac/linux users, only need native terminal which comes with the operating system. You can access it in the Applications section for Mac users. You also need to have X windows installed Link
- For PC users, download and install MobaXterm
- IGV installed on local machine
- Filezilla (or other file transfer client) installed on local machine
- BCCHR VPN
- Linux/Unix Cheatsheet
- Linux/Unix online tutorial
- Editor cheat sheets
- File transfer programs
Complete the Prerequisites (above). If you have any challenges, please contact Dr. Phillip Richmond ([email protected]).
Setting up the working environment
- Installing secure shell and sshfs
- Connecting to the cluster
- Making a workspace
- Link
Monday October 31st, 2022 (1:00-3:00) - Intro to course, Intro to Genome Analysis for Rare Disease, Case Distribution
- Soft-start (15 min: 1:00-1:10)
- Course Introduction (Lecture, 75 min: 1:10-2:25)
- Distribute cases & link to resources (Work-along, 15 min - 2:25-2:40)
- Introduce concepts presented in the cases
- Establish working environment (Work-along, 20 minutes - 2:40-3:00)
- Install IGV
- Login to cluster
- Install SSHFS
- Introduction to command line and HPC (Work-along, 45 min - 1:00-1:45)
- Link
- Key commands for navigating
- ls, mkdir, cd, pwd
- Key commands for file handling (& exposure to genomics file formats)
- more, less, grep, head, cut, tail, wc, chmod
- Fasta, Fastq, VCF
- Introduction to file editing, bash scripts (Work-along, 30 min - 1:45-2:15)
- Nano
- Bash scripts
- Variables
- Problem Set 1 (45 min - 2:15-3:00) - basic linux navigation + file handling
- Problem Set 1 Review (20 min: 8:30-8:50)
- Short read mapping & Visualization (Work-along, 60 min - 8:50-9:50)
- Link
- Pipeline Overview
- Exploring command line tools and how to use them
- Map short reads to reference genome with BWA mem
- Convert, sort, index, with Samtools
- Problem Set 2 (40 min: 9:50-10:30) - short read mapping with bash scripts
- Problem Set 2 Review (30 min: 1:00-1:30)
- Interacting with the cluster scheduler (Work-along, 45 min - 1:30-2:15)
- Link
- Concept of job scheduler on cluster
- Basic job scripts
- Mapping and visualization through cluster
- Problem Set 3 (45 min - 2:15-3:00) - Case processing fastq-->BAM + Visualize
- Problem Set 3 Review (30 min - 1:00-1:30)
- Call variants with DeepTrio (Work-along, 45 min - 1:30-2:15)
- Link
- Call variants using pre-built DeepTrio scripts on pre-processed small BAM files
- Visualize variants in IGV
- Load GRCh38 genome
- Load read files
- Zoom to region with read coverage
- Take snapshot of variant
- Problem Set 4 (45 min - 2:15-3:00) - Call variants on samples
-
- Extra time for Problem Set 4 (60 min - 8:30-9:30)
- Open discussion/review of sessions 1-5
- Ensure that everyone has mapped, called variants, and visualized their case data
- Link
- What is Exomiser?
- Run exomiser using pre-built configs and scripts on Case 4
- Interpreting Exomiser output in excel/html (Work-along, 45 min - 1:45-2:30)
- Download exomiser output in TSV format, open and explore tabs in excel
- Visualize output HTML file
- Problem Set 5 (30 min - 2:30-3:00) - Run Exomiser on case 3
- Review Problem Set 5, address any issues, make sure everyone has their case exomiser output. (60 min - 1:00-2:00)
- Open help session, working on Practical (60 min - 2:00-3:00)
- Practical (120 min - 8:30-10:30)
- Each student presents their case to the group, 8 students ~15 minutes / presentation (10 min present, 5 min questions).