Workshop Attendees
If you are attending one of our workshops, we will provide a training environment with all of the required software and data. There is no need for you to set anything up in advance.
These instructions are for those who would like setup their own computer to run the analysis demonstrated on the materials.
There are two groups of software tools that we used in the workshop:
We start by giving instructions on how to get a Linux distribution on your computer. However, we note that the command line steps are often computationally heavy and not suitable to be run on a standard laptop. Our recommendation would be to perform those steps on a HPC cluster.
The R analysis are typically fine on a standard laptop.
The recommendation for bioinformatic analysis is to have a dedicated computer running a Linux distribution. The kind of distribution you choose is not critical, but we recommend Ubuntu if you are unsure.
You can follow the installation tutorial on the Ubuntu webpage.
Installing Ubuntu on the computer will remove any other operating system you had previously installed, and can lead to data loss.
The Windows Subsystem for Linux (WSL2) runs a compiled version of Ubuntu natively on Windows.
There are detailed instructions on how to install WSL on the Microsoft documentation page. But briefly:
wsl --install
.
sudo apt install
command as usual.After WSL is installed, it is useful to create shortcuts to your files on Windows. Your C:\
drive is located in /mnt/c/
(equally, other drives will be available based on their letter). For example, your desktop will be located in: /mnt/c/Users/<WINDOWS USERNAME>/Desktop/
. It may be convenient to set shortcuts to commonly-used directories, which you can do using symbolic links, for example:
ln -s /mnt/c/Users/<WINDOWS USERNAME>/Documents/ ~/Documents
ln -s /mnt/c/Users/<WINDOWS USERNAME>/OneDrive/Documents/ ~/Documents
ln -s /mnt/c/Users/<WINDOWS USERNAME>/Desktop/ ~/Desktop
ln -s /mnt/c/Users/<WINDOWS USERNAME>/Downloads/ ~/Downloads
Another way to run Linux within Windows (or macOS) is to install a Virtual Machine. However, this is mostly suitable for practicing and not suitable for real data analysis.
Detailed instructions to install an Ubuntu VM using Oracle’s Virtual Box is available from the Ubuntu documentation page.
Note: In the step configuring “Virtual Hard Disk” make sure to assign a large storage partition (at least 100GB).
After installing Ubuntu (through either of the methods above), open a terminal and run the following commands to update your system and install some essential packages:
sudo apt update && sudo apt upgrade -y && sudo apt autoremove -y
sudo apt install -y git
sudo apt install -y default-jre
We recommend using the Conda package manager to install the command line software. In particular, the newest implementation called Mamba.
To install Mamba, run the following commands from the terminal:
wget "https://github.com/conda-forge/miniforge/releases/latest/download/Mambaforge-$(uname)-$(uname -m).sh"
bash Mambaforge-$(uname)-$(uname -m).sh -b
rm Mambaforge-$(uname)-$(uname -m).sh
Restart your terminal (or open a new one) and confirm that your shell now starts with the word (base)
. Then run the following commands:
conda config --add channels defaults; conda config --add channels bioconda; conda config --add channels conda-forge
conda config --set remote_read_timeout_secs 1000
The command line software used in data pre-processing can be installed using the mamba
package manager.
mamba create --name rnaseq
mamba install --name rnaseq picard salmon fastqc
The R packages used in this course can be installed with the following commands from an R console:
if (!require("BiocManager", quietly = TRUE)){
install.packages("BiocManager")
}
BiocManager::install(c("AnnotationDbi",
"AnnotationHub",
"ComplexHeatmap",
"DESeq2",
"DT",
"GenomicAlignments",
"GenomicFeatures",
"GenomicRanges",
"Glimma",
"biomaRt",
"circlize",
"clusterProfiler",
"corrplot",
"enrichplot",
"ensembldb",
"gganimate",
"ggbio",
"ggdendro",
"ggfortify",
"ggrepel",
"ggvenn",
"msigdbr",
"org.Mm.eg.db",
"patchwork",
"pathview",
"rtracklayer",
"shape",
"tidyverse",
"tximport"))