User guide

User guide#

scvi-tools is composed of models that can perform one or many analysis tasks. In the user guide, we provide an overview of each model with emphasis on the math behind the model, how it connects to the code, and how the code connects to analysis.

scRNA-seq analysis#

Model	Tasks	Reference
scVI	Dimensionality reduction, removal of unwanted variation, integration across replicates, donors, and technologies, differential expression, imputation, normalization of other cell- and sample-level confounding factors	[Lopez et al., 2018]
scANVI	scVI tasks with cell type transfer from reference, seed labeling	[Xu et al., 2021]
LDVAE	scVI tasks with linear decoder	[Svensson et al., 2020]
AUTOZI	for assessing gene-specific levels of zero-inflation in scRNA-seq data	[Clivio et al., 2019]
CellAssign	Marker-based automated annotation	[Zhang et al., 2019]
Solo	Doublet detection	[Bernstein et al., 2020]
scAR	Ambient RNA removal	[Sheng et al., 2022]
contrastiveVI	scVI tasks with contrastive analysis	[Weinberger et al., 2023]
MrVI	Characterization of sample-level heterogeneity	[Boyeau et al., 2025]
SysVI	Integrating single-cell RNA-seq datasets with substantial batch effects	[Hrovatin et al., 2023]
Decipher	Joint representation and visualization of derailed cell states with Decipher	[Nazaret et al., 2024]
VeloVI	Deep generative modeling of transcriptional dynamics for RNA velocity analysis in single cells	[Gayoso et al., 2023]
DRVI	Unsupervised deep disentangled representation learning of single-cell omics	[Moinfar and Theis, 2024]
Joint-Embedding SCVI	Improving SCVI for low-count cells through self-supervised augmentation	[Svensson, 2026]

ATAC-seq analysis#

Model	Tasks	Reference
PeakVI	Dimensionality reduction, removal of unwanted variation, integration across replicates, donors, and technologies, differential expression, imputation, normalization of other cell- and sample-level confounding factors	[Ashuach et al., 2022]
scBasset	Representation learning on scATAC-seq data, integration of data across several samples	[Yuan and Kelley, 2022]
PoissonVI	Analyzing scATAC-seq data using quantitative fragment counts	[Martens et al., 2023]

BS-seq analysis#

Model	Tasks	Reference
MethylVI	Anlaysis of single-cell bisulfite data from several sequencing platforms	[Weinberger et al., 2026]
MethylANVI	MethylVI tasks along with cell type label transfer from reference, seed labeling	[Weinberger et al., 2026]

Cytometry analysis#

Model	Tasks	Reference
CytoVI	Correct batch effects, perform integration and downstream analysis in cytometry data	[Ingelfinger et al., 2025]

Multimodal analysis#

CITE-seq#

Model	Tasks	Reference
totalVI	Dimensionality reduction, removal of unwanted variation, integration across replicates, donors, and technologies, differential expression, protein imputation, imputation, normalization of other cell- and sample-level confounding factors	[Gayoso et al., 2021]
TotalANVI	A probabilistic generative model for single-cell RNA and CITE-seq protein data that integrates semi-supervised cell type annotations to jointly infer both protein expression and cell states	[]

Multiome#

Model	Tasks	Reference
MultiVI	Integration of paired/unpaired multiome data, missing modality imputation, normalization of other cell- and sample-level confounding factors	[Ashuach et al., 2023]
DiagVI	Diagonal integration of unpaired multiome data, dimensionality reduction, cross-modality imputation, cell label transfer

Spatial transcriptomics analysis#

Model	Tasks	Reference
DestVI	Multi-resolution deconvolution, cell-type-specific gene expression imputation, comparative analysis	[Lopez et al., 2022]
Stereoscope	Deconvolution	[Andersson et al., 2020]
gimVI	Imputation of missing spatial genes	[Lopez et al., 2019]
Tangram	Deconvolution, single cell spatial mapping	[Biancalani et al., 2021]
ResolVI	Generative model of single-cell resolved spatial transcriptomics	[Ergen and Yosef, 2025]
scVIVA	Representation of cells and their environments in spatial transcriptomics	[Levy et al., 2025]

General purpose analysis#

Model	Tasks	Reference
Amortized LDA	Topic modeling	[Blei et al., 2003]
Scvi-Hub	Scvi-hub: an actionable repository for model-driven single-cell analysis usign Hugging Face Hub	[Ergen et al., 2025]

Glossary#

Model

A Model class inherits BaseModelClass and is the user-facing object for interacting with a module. The model has a train method that learns the parameters of the module, and also contains methods for users to retrieve information from the module, like the latent representation of cells in a VAE. Conventionally, the post-inference model methods should not store data into the AnnData object, but instead return “standard” Python objects, like numpy arrays or pandas dataframes.

Module

A module is the lower-level object that defines a generative model and inference scheme. A module will either inherit BaseModuleClass or PyroBaseModuleClass. Consequently, a module can either be implemented with PyTorch alone, or Pyro. In the PyTorch only case, the generative process and inference scheme are implemented respectively in the generative and inference methods, while the loss method computes the loss, e.g, ELBO in the case of variational inference.

TrainingPlan

The training plan is a PyTorch Lightning Module that is initialized with a scvi-tools module object. It configures the optimizers, defines the training step and validation step, and computes metrics to be recorded during training. The training step and validation step are functions that take data, run it through the model and return the loss, which will then be used to optimize the model parameters in the Trainer. Overall, custom training plans can be used to develop complex inference schemes on top of modules.

Trainer

The Trainer is a lightweight wrapper of the PyTorch Lightning Trainer. It takes as input the training plan, a training data loader, and a validation dataloader. It performs the actual training loop, in which parameters are optimized, as well as the validation loop to monitor metrics. It automatically handles moving data to the correct device (CPU/GPU).