# Bayesian inference of RNA velocity incorporating timepoints, lineage bifurcations, and count data

**Authors:** Yichen Gu, Yuxuan Song, David Blaauw, Joshua D. Welch, Ilya Ioshikhes, Wei Li, Ilya Ioshikhes, Wei Li, Ilya Ioshikhes, Wei Li

PMC · DOI: 10.1371/journal.pcbi.1014060 · PLOS Computational Biology · 2026-03-20

## TL;DR

VeloVAE is a new Bayesian method that improves the accuracy of tracking gene expression changes in cells over time, even when data is limited to single snapshots.

## Contribution

VeloVAE introduces a Bayesian framework for RNA velocity that models lineage bifurcations, discrete count data, and latent time with improved accuracy.

## Key findings

- VeloVAE outperforms existing methods in reconstructing gene expression dynamics and estimating transcription rates.
- The model accurately captures complex biological systems like hematopoiesis and brain development.
- Automatically inferred latent time performs better than manually defined pseudotime in some cases.

## Abstract

Experimental approaches for measuring single-cell gene expression can observe each cell at only one time point, requiring computational approaches for reconstructing the dynamics of gene expression during cell fate transitions. RNA velocity is a promising computational approach for this problem, but existing inference methods fail to capture key aspects of real data, limiting their utility. To address these limitations, we developed VeloVAE, a Bayesian model for RNA velocity inference. VeloVAE uses variational Bayesian inference to estimate the posterior distribution of latent time, latent cell state, and kinetic rate parameters for each cell. Our approach can incorporate prior distributions on rate parameters and time points; model lineage bifurcations using branching differential equations; and directly model discrete count data. We show that VeloVAE significantly outperforms previous approaches in terms of data fit, accuracy of inferred differentiation directions, and transcription rate estimation. These improvements allow VeloVAE to accurately model gene expression dynamics in complex biological systems, including hematopoiesis, induced pluripotent stem cell reprogramming, the developing mouse brain, and the entire mouse embryo. We find that the latent time automatically inferred using all cells can even outperform pseudotime inferred using manually chosen cell subsets and root cells. Our work provides important new tools for modeling sequential changes in gene expression from single-cell expression data.

Understanding how cells change their identity over time is a central question in biology. However, most experiments that measure gene activity in individual cells can only capture a single snapshot of each cell, making it difficult to see how cells move through developmental or disease-related processes. In our work, we developed a computational method VeloVAE to reconstruct these cellular trajectories from scRNA-seq data. Our approach uses a Bayesian framework to infer the latent time, cell state and the rates at which genes turn on or off. By integrating biological information and modeling complex processes such as cell type bifurcation, VeloVAE provides a more accurate and interpretable view of cell development than previous approaches. We demonstrate that VeloVAE can reveal key patterns of gene activity across several biological systems, including blood formation, brain development, and stem cell reprogramming. Overall, our work provides a general framework for understanding dynamic gene regulation from single-cell data.

## Linked entities

- **Species:** Mus musculus (taxon 10090)

## Full-text entities

- **Genes:** Cnn2 (calponin 2) [NCBI Gene 12798] {aka Calpo2}, CD8A (CD8 subunit alpha) [NCBI Gene 925] {aka CD8, CD8alpha, IMD116, Leu2, p32}, Krt7 (keratin 7) [NCBI Gene 110310] {aka D15Wsu77e, K7, Krt2-7}, CD4 (CD4 molecule) [NCBI Gene 920] {aka CD4mut, IMD79, Leu-3, OKT4D, T4}, FCGR3A (Fc gamma receptor IIIa) [NCBI Gene 2214] {aka CD16-II, CD16A, FCG3, FCGR3, FCRIIIA, FcGRIIIA}, SMOC1 (SPARC related modular calcium binding 1) [NCBI Gene 64093] {aka OAS}, Hsp90aa1 (heat shock protein 90, alpha (cytosolic), class A member 1) [NCBI Gene 15519] {aka 86kDa, 89kDa, Hsp86-1, Hsp89, Hsp90, Hspca}, NNAT (neuronatin) [NCBI Gene 4826] {aka Peg5}, NEUROG3 (neurogenin 3) [NCBI Gene 50674] {aka Atoh5, Math4B, NGN-3, bHLHa7, ngn3}, Ppp1r1a (protein phosphatase 1, regulatory inhibitor subunit 1A) [NCBI Gene 58200] {aka 0610038N18Rik, I-1}, CD14 (CD14 molecule) [NCBI Gene 929], Atp1a2 (ATPase, Na+/K+ transporting, alpha 2 polypeptide) [NCBI Gene 98660] {aka Atpa-3, mKIAA0778}, Nr2f1 (nuclear receptor subfamily 2, group F, member 1) [NCBI Gene 13865] {aka COUP-TF1, COUP-TFI, COUPTFA, EAR-3, EAR3, Erbal3}
- **Diseases:** CBDir (MESH:C537866), IPS (MESH:C536271), cancer (MESH:D009369), PCA (MESH:C562643)
- **Chemicals:** CBDir (-), PNAS (MESH:D020135)
- **Species:** Mus musculus (house mouse, species) [taxon 10090], Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13021174/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13021174/full.md

## References

52 references — full list in the complete paper: https://tomesphere.com/paper/PMC13021174/full.md

---
Source: https://tomesphere.com/paper/PMC13021174