# Lorentz-regularized interpretable VAE for multi-scale single-cell transcriptomic and epigenomic embeddings

**Authors:** Zeyu Fu, Jiawei Fu, Chunlin Chen, Keyang Zhang, Song Wang

PMC · DOI: 10.3389/fgene.2025.1713727 · Frontiers in Genetics · 2026-01-05

## TL;DR

This paper introduces LiVAE, a new method for analyzing single-cell data that balances detailed local patterns with global structure using hyperbolic geometry.

## Contribution

The novel use of hyperbolic geometry as soft regularization in a dual-pathway VAE resolves the local–global trade-off in single-cell data.

## Key findings

- LiVAE achieves superior global topology preservation and latent geometry across 135 datasets.
- The method shows enhanced robustness to noise and improved embedding quality in UMAP and t-SNE visualizations.
- Biologically meaningful latent axes were revealed in a Dapp1 perturbation dataset.

## Abstract

Single-cell multi-omics technologies capture cellular heterogeneity at unprecedented resolution, yet dimensionality reduction methods face a fundamental local–global trade-off: approaches optimized for local neighborhood preservation distort global topology, while those emphasizing global coherence obscure fine-grained cell states.

We introduce the Lorentz-regularized variational autoencoder (LiVAE), a dual-pathway architecture that applies hyperbolic geometry as soft regularization over standard Euclidean latent spaces. A primary encoding pathway preserves local transcriptional details for high-fidelity reconstruction, while an information bottleneck (BN) pathway extracts global hierarchical structure by filtering technical noise. Lorentzian distance constraints enforce geometric consistency between pathways in hyperbolic space, enabling LiVAE to balance local fidelity with global coherence without requiring specialized batch-correction procedures. Systematic benchmarking across 135 datasets against 21 baseline methods demonstrated that LiVAE achieves superior global topology preservation (distance correlation gains: 0.209–0.436), richer latent geometry (manifold dimensionality: 0.123–0.467; participation ratio: 0.149–0.761), and enhanced robustness (noise resilience: 0.184–0.712) while maintaining competitive local fidelity. The overall embedding quality improved by 0.051–0.284 across uniform manifold approximation and projection (UMAP) and t-distributed stochastic neighbor embedding (t-SNE) visualizations. Component-wise interpretability analysis on a Dapp1 perturbation dataset revealed biologically meaningful latent axes.

LiVAE provides a robust, general-purpose framework for single-cell representation learning that resolves the local–global trade-off through geometric regularization. By maintaining Euclidean latent spaces while leveraging hyperbolic priors, LiVAE enables improved developmental trajectory inference and mechanistic biological discovery without sacrificing compatibility with existing computational ecosystems.

## Linked entities

- **Genes:** DAPP1 (dual adaptor of phosphotyrosine and 3-phosphoinositides 1) [NCBI Gene 27071]

## Full-text entities

- **Genes:** DAPP1 (dual adaptor of phosphotyrosine and 3-phosphoinositides 1) [NCBI Gene 27071] {aka BAM32}

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12812404/full.md

## Figures

12 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12812404/full.md

## References

63 references — full list in the complete paper: https://tomesphere.com/paper/PMC12812404/full.md

---
Source: https://tomesphere.com/paper/PMC12812404