# VARGG: a deep learning framework advancing precise spatial domain identification and cellular heterogeneity analysis in spatial transcriptomics

**Authors:** Mengqiu Wang, Zhiwei Zhang, Lixin Lei, Kaitai Han, Zhenghui Wang, Ruoyan Dai, Zijun Wang, Chaojing Shi, Xudong Zhao, Qianjin Guo

PMC · DOI: 10.1093/bfgp/elaf018 · Briefings in Functional Genomics · 2025-11-23

## TL;DR

VARGG is a deep learning framework that improves the accuracy of identifying spatial domains and analyzing cellular diversity in spatial transcriptomics data.

## Contribution

VARGG introduces a novel combination of Vision Transformer and graph neural networks for enhanced spatial domain identification.

## Key findings

- VARGG accurately delineates spatial domains across multiple spatial transcriptomics platforms.
- The framework improves feature representation and model generalizability using gated residual graph networks and Gaussian noise.
- VARGG helps identify molecular markers and therapeutic targets in diseases like glioblastoma and breast cancer.

## Abstract

Spatial transcriptomics has revolutionized our ability to measure gene expression while preserving spatial information, thus facilitating detailed analysis of tissue structure and function. Identifying spatial domains accurately is key for understanding tissue microenvironments and biological progression. To overcome the challenge of integrating gene expression data with spatial information, we introduce the VARGG deep learning framework. VARGG combines a pretrained Vision Transformer (ViT) with a graph neural network autoencoder, utilizing ViT’s self-attention mechanism to capture global contextual information and enhance understanding of spatial relationships. This framework is further enhanced by multi-layer gated residual graph neural networks and Gaussian noise, which improve feature representation and model generalizability across different data sources. The robustness and scalability of VARGG have been verified on different platforms (10x Visium, Slide-seqV2, Stereo-seq, and MERFISH) and datasets of different sizes (human glioblastoma, mouse embryo, breast cancer). Our results demonstrate that VARGG’s ability to accurately delineate spatial domains can provide a deeper understanding of tissue structure and help identify key molecular markers and potential therapeutic targets, thereby improving our understanding of disease mechanisms and providing opportunities for personalization to inform the development of treatment strategies.

## Linked entities

- **Diseases:** glioblastoma (MONDO:0018177), breast cancer (MONDO:0004989)
- **Species:** Mus musculus (taxon 10090)

## Full-text entities

- **Diseases:** glioblastoma (MESH:D005909), breast cancer (MESH:D001943)
- **Species:** Homo sapiens (human, species) [taxon 9606], Mus musculus (house mouse, species) [taxon 10090]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12640549/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12640549/full.md

## References

55 references — full list in the complete paper: https://tomesphere.com/paper/PMC12640549/full.md

---
Source: https://tomesphere.com/paper/PMC12640549