Introduction to Bulk RNA-seq data analysis
2nd, 3rd, 4th Oct 2024
In person
Bioinformatics Training Facility, Craik-Marshall Building, Downing Site, University of Cambridge
Instructors
- Ashley Sawle (CRUK Cambridge Institute)
- Abbi Edwards (CRUK Cambridge Institute)
- Betty Wang (Dpt Clinical Neurosciences, University of Cambridge)
- Ian Tsang (NIAB)
- Jiayin Hong (Dpt Biochemistry, University of Cambridge)
- Ramy Ragheb (Stem Cell Institute, University of Cambridge)
Outline
In this workshop, you will be learning how to analyse RNA-seq data. This will include read alignment, quality control, quantification against a reference, reading the count data into R, performing differential expression analysis, and gene set testing, with a focus on the DESeq2 analysis workflow. You will learn how to generate common plots for analysis and visualisation of gene expression data, such as boxplots and heatmaps.
This workshop is aimed at biologists interested in learning how to perform differential expression analysis of RNA-seq data.
Prerequisites
**Some basic experience of using a UNIX/LINUX command line is assumed**
**Some R knowledge is assumed and essential. Without it, you will struggle on this course.** If you are not familiar with the R statistical programming language we strongly encourage you to work through an introductory R course before attempting these materials. We recommend our Introduction to R course
Timetable
Day 1
Trainers: Abbi, Betty, Ian, Ramy, Hugo
9:30 - 9:45 - Welcome! - Bajuna
9:45 - 10:15 - Introduction to RNAseq Methods - Betty
10:15 - 11:00 Raw read file format and QC - Betty
11:00 - 13:30 Alignment and Quantification of Gene Expression with Salmon - Hugo
13:30 - 14:30 Lunch
14:30 - 15:30 QC of alignment - Abbi
15.30 - 17.30 Data Exploration in R - Ian
Day 2
Trainers: Ash, Abbi (AM), Ian, Ramy
9:30 - 10:15 Introduction to RNAseq Analysis in R - Abbi
10:15 - 11:30 Statistical Analysis of Bulk RNAseq Data - Abbi
11:30 - 17:30 Linear Models in R and DESeq2 (Slides) - Abbi/Ash
- Practical - Differential Expression for RNA-seq (pdf) - Ash/Abbi
- practical solutions (pdf)
- live script
- Linear Models in R and DESeq2 (Worksheet) (pdf)
- DESeq2 results extraction cheatsheet
13:00 - 14:00 Lunch
Day 3
Trainers: Ash, Ian, Jiayin, Ramy
9:30 - 9:45 - Recap of Day 1 and 2 - Ash
9:45 - 11:00 Annotation of RNA-seq results - Ash.
11:00 - 12.30 Visualisation differential expression results - Jiayin
12.30 - 13.30 Lunch
13.30 - 16:30 Gene-set testing - Ian
Data, software and materials
The lecture slides and other source materials, including R code and practical solutions, can be found in the course’s Github repository
The full data used in the course can be downloaded from dropbox.
Instructions to install software are available from the “Software installation instructions” page.
Extended materials
The Extended Materials contain extensions to some of the sessions and additional materials, including instruction on downloading and processing the raw data for this course, a link to an excellent R course, and where to get further help after the course.
Additional Resources
Acknowledgements
This course is based on the course RNAseq analysis in R prepared by Combine Australia and delivered on May 11/12th 2016 in Carlton. We are extremely grateful to the authors for making their materials available; Maria Doyle, Belinda Phipson, Matt Ritchie, Anna Trigos, Harriet Dashnow, Charity Law.
The materials have been rewritten/modified/corrected/updated by various contributors over the past 5 years including:
Abigail Edwards Ashley D Sawle Chandra Chilamakuri Dominique-Laurent Couturier Guillermo Parada González Hugo Tavares Jon Price Mark Dunning Mark Fernandes Oscar Rueda Sankari Nagarajan Stephane Ballereau Tom Smith Zeynep Kalender Atak
Apologies if we have missed anyone!