6th January 2026
Image source: https://web.genewiz.com/single-cell-faq
Aim of QC is …
Above is achieved by …
Seurat (v5) Hao, Y., Stuart, T., Kowalski, M.H. et al.
Take care of version differences!
Orchestrating Single-Cell Analysis with Bioconductor Robert Amezquita, Aaron Lun, Stephanie Hicks, Raphael Gottardo
http://bioconductor.org/books/release/OSCA/
Single-Cell Best Practices Heumos, L., Schaar, A.C., Lance, C. et al.
CellRanger outputs: gives two output folders raw and filtered
Each folder has three zipped files
assays:
This slot stores the raw and processed data in different forms. It is a list of Assay objects, each representing a specific type of data.
Examples: RNA: The most commonly used assay, containing the raw and processed RNA counts. SCT: Stores data processed using SCTransform (a normalization method). integrated: Contains integrated data when datasets have been merged. Each assay can contain matrices like counts, data, and scale.data.
Unlike the bioconductor SingleCellExperiment Object there is not a specific place to store feature data (gene annotations etc) in the Seurat object. This information is typically stored within the assay itself or in the metadata associated with the object as a whole.
meta.data:
A data.frame containing metadata associated with each cell. This can include cell type annotations, experimental conditions, or other variables related to the cells.
Example columns: cell_type, batch, condition, cluster.
reductions:
A list of dimensionality reductions applied to the data. These are used for visualizations like PCA, t-SNE, or UMAP.
Examples: pca, tsne, umap. Each reduction stores a dimensionality reduction object, which contains information about the reduced coordinates (e.g., UMAP coordinates) and additional metadata like the variance explained by principal components.
graphs: A list of graphs (usually a nearest-neighbor graph) that are used for clustering and other analyses. The graph typically represents relationships between cells based on gene expression similarity.
Examples: RNA_snn (a shared nearest-neighbor graph for RNA-seq data), pca_snn.
clusters: This stores the cluster assignments for each cell after a clustering analysis (e.g., Louvain or Leiden clustering). It is typically stored in the meta.data slot but can also be stored in a separate slot.
commands: A record of the commands used to generate the object. This can help in reproducibility by storing a log of the methods and operations that were applied to create or manipulate the Seurat object.
misc: This slot is used to store arbitrary information that doesn’t fit into the other slots. It can be used to store additional analysis results or custom annotations.