Tutorials

Tutorials#

The easiest way to get familiar with scvi-tools is to follow along with our tutorials. Many are also designed to work seamlessly in Google Colab, a free cloud computing platform. Tutorials by default work with the latest installable version of scvi-tools. To view older tutorials, change the documentation version using the tab at the bottom of the left sidebar.

Note

For questions about using scvi-tools, or broader questions about modeling data, please use our forum. Checkout the ecosystem for additional models powered by scvi-tools.

Introduction to scvi-tools

Go through the typical steps of an scvi-tools workflow

Quick start

Data loading and preparation

Load, preprocess, and register data for use with scvi-tools

Quick start

Atlas-level integration of lung data

Perform integration of multiple scRNA-seq datasets both with and without cell type annotation (scVI and scANVI)

scRNA-seq

MrVI Quick Start Tutorial

Analyze multi-sample scRNA-seq data with MrVI

scRNA-seq

Benchmarking the scANVI fix

Compare scANVI to other models following a bug fix in scvi-tools 1.1.0

scRNA-seq

Seed labeling with scANVI

Create seed labels and transfer cell type annotations to an entire dataset

scRNA-seq

Integration and label transfer with Tabula Muris

Perform de novo integration of a labeled reference dataset with an unlabeled query dataset (label transfer)

scRNA-seq

Differential expression on C. elegans data

Perform DE analysis on C. elegans data with scVI to quantify differences in gene expression between groups of cells

scRNA-seq

Annotation with CellAssign

Use CellAssign to assign cell types using only knowledge of marker genes

scRNA-seq

Isolating perturbation-induced variations with contrastiveVI

Use contrastiveVI to isolate perturbation-induced variation in Perturb-seq data

scRNA-seq

Linearly decoded VAE

Fit an LDVAE model to scRNA-seq data and interpret how genes are linked to latent variables of cells

scRNA-seq

Topic Modeling with Amortized LDA

Run the amortized Latent Dirichlet Allocation model in scvi-tools to learn topics of an scRNA-seq dataset

scRNA-seq

Identification of zero-inflated genes

Use the AutoZI model to enable gene-specific treatment of zero-inflation

scRNA-seq

Integration of scRNA-seq data with substantial batch effects using sysVI

Integrate scRNA-seq datasets with substantial batch effects.

scRNA-seq

Disentangled representation learning with DRVI

Learn an interpretable, disentangled representation of scRNA-seq data with DRVI and link latent dimensions to genes.

scRNA-seq

Decipher Quick Start Tutorial

Use Decipher to jointly analyze samples from distinct conditions.

scRNA-seq

Variational inference for RNA velocity with VeloVI

Use VeloVI to estimate RNA velocity.

scRNA-seq

Improving embeddings for low-count cells with JointEmbeddingSCVI

Use JointEmbeddingSCVI for improving SCVI for low-count cells through self-supervised augmentation

scRNA-seq

MrVI analysis over Tahoe100M cells dataset

Analyze Tahoe100M cells dataset with MrVI in PyTorch

scRNA-seq

PeakVI: Analyzing scATACseq data

Go through the PeakVI workflow to analyze a scATAC-seq dataset

ATAC-seq

PoissonVI: Analyzing quantitative scATAC-seq fragment counts

Go through the PoissonVI workflow to analyze scATAC-seq data using quantitative fragment counts

ATAC-seq

ScBasset: Analyzing scATACseq data

Go through the scBasset workflow to analyze a scATAC-seq dataset

ATAC-seq

scBasset: Batch correction of scATACseq data

Use scBasset to integrate data across several samples

ATAC-seq

Integrating single-cell methylation data from different scBS-seq experiments with methylVI

Correct batch effects in across different scBS-seq experiments with methylVI

scBS-seq

CITE-seq analysis with totalVI

Go through the totalVI workflow to analyze CITE-seq datasets

Multimodal

Reference mapping with SCVI-Tools

Map cells from a query dataset to the latent space of a reference dataset with the scArches method

Multimodal

CITE-seq reference mapping with totalVI

Use totalVI to train a reference model and map CITE-seq query data

Multimodal

Integration of CITE-seq and scRNA-seq data

Use totalVI to integrate CITE-seq and scRNA-seq datasets

Multimodal

Joint analysis of paired and unpaired multiomic data with MultiVI

Go through the MultiVI workflow to perform joint analysis of paired and unpaired multi omic data

Multimodal

Comparing integration metrics using scib-metrics package

Use TotalANVI to perform semi-supervised analysis of CITE-seq data, leveraging partial cell type annotations for label prediction, protein imputation, and differential abundance

Multimodal

Integration of scRNA-seq and spatial proteomics data with DiagVI

Perform integration of spatial proteomics and single-cell transcriptomics data with DiagVI

Multimodal

Integration of scRNA-seq and spatial transcriptomics data with DiagVI

Perform integration of spatial and single-cell transcriptomics data with DiagVI

Multimodal

ResolVI to address noise and biases in spatial transcriptomics

Use resolVI to correct cellular-resolved spatial transcriptomics data.

Spatial transcriptomics

scVIVA for representing cells and their environment in spatial transcriptomics

Stratify spatial transcriptomics data into niche-aware cell states with scVIVA

Spatial transcriptomics

Multi-resolution deconvolution of spatial transcriptomics

Perform multi-resolution analysis on spatial transcriptomics data with DestVI

Spatial transcriptomics

Introduction to gimVI

Use gimVI to impute missing genes in spatial data

Spatial transcriptomics

Spatial mapping with Tangram

Use Tangram to map spatial transcriptomics data

Spatial transcriptomics

Stereoscope applied to left ventricule data

Go through the Stereoscope workflow to map single-cell data

Spatial transcriptomics

Mapping human lymph node cell types to 10X Visium with Cell2location

Spatially map lymph node cell types using Cell2location

Spatial transcriptomics

Quick start tutorial for CytoVI

Correct batch effects in cytometry data experiments with cytoVI

Cytometry

Advanced Tutorial: Multi-Panel Integration and Downstream Analysis with CytoVI

Perform multi-panel integration and downstream advanced analysis with CytoVI

Cytometry

Using scvi-hub to download pretrained scvi-tools models

Learn how to use Hugging Face and scvi-hub to download pretrained scvi-tools models

Model hub

Using scvi-hub to upload pretrained scvi-tools models

Learn how to upload pretrained scvi-tools models to Hugging Face

Model hub

Use pretrained models of scVI-hub for CELLxGENE

Perform analysis of a CELLxGENE dataset using a pretrained model from scVI-hub

Model hub

Querying the Human Lung Cell Atlas

Use scANVI, scArches, and scvi-hub to query the Human Lung Cell Atlas

Model hub

Use pretrained models of scVI-hub for Tahoe100M

Query pre-trained SCVI model that was trained over Tahoe100M cells dataset and stored on hub

Model hub

Preprocessing datasets for analysis with scvi-tools

Learn how to preprocess various types of data for use with scvi-tools models.

Common Modelling Use Cases

Model hyperparameter tuning with scVI

Automatically find optimal set of hyperparameters using autotune.

Common Modelling Use Cases

Minification

Minify a dataset by replacing count data with the model’s estimated parameters of the latent posterior distribution

Common Modelling Use Cases

Using SHAP values and IntegratedGradients for cell type classification interpretability

Use integrated gradient or SHAP values for model explainability

Common Modelling Use Cases

Train a scVI model using multiGPU

Example of how to train an SCVI model using multi GPU settings

Common Modelling Use Cases

Using Python in R with `reticulate`

Perform basic Python operations in an R environment

R Tutorials

Introduction to scvi-tools in R

Go through the typical steps of an scvi-tools workflow in R

R Tutorials

Integrating datasets with scVI in R

Use basic scvi-tools functionality in R including integration of datasets

R Tutorials

CITE-seq analysis in R

Use scvi-tools functionality in R to analyze CITE-seq data

R Tutorials

ATAC-seq analysis in R

Use scvi-tools functionality in R to analyze scATAC-seq data

R Tutorials

Multi-resolution deconvolution of spatial transcriptomics in R

Use scvi-tools functionality in R to analyze spatial transcriptomics datasets

R Tutorials

Data handling in scvi-tools

Learn about how data is handled in scvi-tools

Development

Constructing a probabilistic module

Implement a novel statistical method for single-cell omics data as a module

Development

Constructing a high-level model

Implement an scvi-tools model class to provide a convenient interface for the lower-level module objects

Development

Train a scVI model using Census data

Learn a scalable approach using TileDBDataModule dataloader to training an scVI model on Census data.

Custom Data Loaders

Train a scVI model using Lamin

Use the Lamin MappedCollectionDataModule for a scalable approach to training an scVI model on multiple adata's.

Custom Data Loaders

Train a scVI model using Anncollection dataloader wrapper

Use the AnnCollection dataloader for a scalable approach to training an scVI model on multiple adata's.

Custom Data Loaders

Introduction to scvi-tools with Annbatch and Rapids-singlecell

Use annbatch for AnnData-native disk-backed training with scvi-tools models.

Custom Data Loaders

Notebook not found, path tried: /home/docs/checkouts/readthedocs.org/user_builds/scvi/checkouts/stable/docs/tutorials/notebooks/custom_dl/Tahoe100_mrVI_lamin.ipynb

Use the Lamin Custom Dataloader to train mrVI torch model over Tahoe100M cells dataset

Custom Data Loaders