Description.
This course provides an introduction to the tools available through the Bioconductor project for manipulating and analysing high-throughput sequencing (HTS) data. We will present workflows for the analysis of ChIP-Seq and RNA-seq data starting from aligned reads in bam format. We will also describe the various resources available through Bioconductor to annotate and visualize HTS data, which can be applied to any type of sequencing experiment.
Authors.
Prerequisites.
Aims.
Objectives.
Day One.
Day Two.
Day Three.
We recommend using How to Run the course.RStudio for the practicals
Download the materials from this repository and install the required R and Bioconductor packages from within RStudio. This may take several minutes.
source("http://www.bioconductor.org/biocLite.R")
biocLite(c("Biostrings", "ShortRead", "DESeq", "edgeR","biomaRt", "BSgenome",
"BSgenome.Dmelanogaster.UCSC.dm6", "org.Dm.eg.db",
"TxDb.Dmelanogaster.UCSC.dm3.ensGene", "pasillaBamSubset", "pasilla",
"rtracklayer", "ggbio", "vsn","gplots","RColorBrewer","chipseq","htSeqTools","limma","NBPSeq","tweeDEseqCountData","org.Hs.eg.db","Rcade", "exomeCopy","CNAnorm", "ChIPQC","TxDb.Hsapiens.UCSC.hg19.knownGene","BSgenome.Hsapiens.UCSC.hg19", "ChIPpeakAnno","statmod","locfit"))
Using Docker.
docker run -p 8787:8787 markdunning/ngs-in-bioc
Then load your web browser of choice and enter the address
http://localhost:8787
This will allow you to use RStudio in your web browser with the username and password 'rstudio'
Example Data.
Some bam files are required for RNA-seq analysis that are too large to distribute via github. They can be downloaded from the following links and placed in the folder:-
Day2/bam