# AugGCL: Multimodal graph learning for spatial transcriptomics analysis with enhanced gene and morphological data

**Authors:** Tengfei Ji, Bo Yang, Meng Wang, Hong Ji, Huazhe Yang, Yizhuo Liu

PMC · DOI: 10.1371/journal.pcbi.1013912 · PLOS Computational Biology · 2026-01-23

## TL;DR

AugGCL is a new method for spatial transcriptomics that improves the accuracy of identifying tissue structures by combining gene expression and morphological data.

## Contribution

AugGCL introduces a multimodal graph learning framework that integrates gene and image data to enhance spatial domain identification.

## Key findings

- AugGCL outperforms baseline methods on multiple datasets, including human brain, breast cancer, and mouse embryo.
- The method improves spatial domain recognition accuracy by integrating gene expression with tissue morphology.
- Downstream analyses validated the method's effectiveness in cell annotation and functional enrichment.

## Abstract

Spatial transcriptomics enables the measurement of gene expression in intact tissues. Despite this, reconstructing anatomically accurate spatial domains remains challenging, primarily due to expression sparsity, complex tissue architecture that is characterized by sharp boundaries and long-range continuity, and weak spatial signals. Traditional pipelines typically rely on expression-driven clustering and spatial smoothing, which underperform at boundaries and in sparse regions while neglecting morphological information. To address these challenges, AugGCL is proposed, an augmented graph-convolutional learning framework that enhances spatial structure decoding and gene expression reconstruction through targeted augmentation of both gene and image data. A key component of AugGCL is neighborhood information aggregation mechanism, which integrates expression similarity and spatial proximity to construct a weighted graph and an enhanced expression matrix, addressing sparsity without sacrificing boundary clarity. Additionally, a two stream weighted graph convolutional network jointly models refined gene features and image-derived morphological information, with image-aware auxiliary reconstructions enhancing weak spatial signals and sharpening boundaries. On datasets from the human dorsolateral prefrontal cortex, breast cancer, and mouse embryo, AugGCL outperforms baseline methods across multiple metrics, showing robustness and generalization across a range of datasets. Downstream analysis validated the reliability of the method, confirming its effectiveness in cell annotation, functional enrichment, and mechanistic studies. AugGCL generates clearer spatial domains and significantly advances the application of spatial transcriptomics in tissue structure and disease research.

Spatial transcriptomics is an important technique for revealing tissue structure and disease mechanisms. However, existing spatial domain identification methods have not fully exploited the spatial information embedded in the data, especially when it comes to detailed exploration of tissue structure. This study presents a new tool for spatial domain identification, which makes full use of various types of spatial transcriptomic data from multiple perspectives to accurately identify real cell groupings. The method significantly improves spatial domain recognition accuracy by integrating gene expression with tissue morphology analysis. Experimental results show that this tool not only accurately identifies spatial domains but also provides strong support for the comprehensive exploration of biological tissue structure. On this basis, a series of downstream analyses, including volcano plot generation, functional enrichment analysis, and gene heatmap visualization, were performed. These analyses not only validated the effectiveness of the method but also revealed the functional characteristics and expression patterns of cells within spatial domains. This step further confirmed the broad application potential of the method in cell type annotation, functional enrichment, and mechanistic research, highlighting its significant potential in advancing biological and disease research.

## Linked entities

- **Diseases:** breast cancer (MONDO:0004989)
- **Species:** Homo sapiens (taxon 9606), Mus musculus (taxon 10090)

## Full-text entities

- **Genes:** CXCL14 (C-X-C motif chemokine ligand 14) [NCBI Gene 9547] {aka BMAC, BRAK, KEC, KS1, MIP-2g, MIP2G}, RELN (reelin) [NCBI Gene 5649] {aka ETL7, LIS2, PRO1598, RL}, HPCAL1 (hippocalcin like 1) [NCBI Gene 3241] {aka BDR1, HLP2, VILIP-3}, MBP (myelin basic protein) [NCBI Gene 4155], CNN3 (calponin 3) [NCBI Gene 1266], Myh7 (myosin, heavy polypeptide 7, cardiac muscle, beta) [NCBI Gene 140781] {aka B-MHC, MYH-beta/slow, MyHC-I, Myhc-b, Myhcb, beta-MHC}, PCP4 (Purkinje cell protein 4) [NCBI Gene 5121] {aka PEP-19}, MGP (matrix Gla protein) [NCBI Gene 4256] {aka GIG36, MGLAP, NTI}, PTN (pleiotrophin) [NCBI Gene 5764] {aka HARP, HB-GAM, HBBM, HBGF-8, HBGF8, HBNF}, VAMP1 (vesicle associated membrane protein 1) [NCBI Gene 6843] {aka CMS25, SAX1, SPAX1, SYB1, VAMP-1}, KLK6 (kallikrein related peptidase 6) [NCBI Gene 5653] {aka Bssp, Klk7, PRSS18, PRSS9, SP59, hK6}, SERPINA3 (serpin family A member 3) [NCBI Gene 12] {aka AACT, ACT, GIG24, GIG25}, KRT17 (keratin 17) [NCBI Gene 3872] {aka 39.1, CK-17, K17, PC2, PCHC1}
- **Diseases:** metastasis (MESH:D009362), Ductal Carcinoma (MESH:D044584), breast cancer (MESH:D001943), LCIS (MESH:D000071960), DCIS (MESH:D002285), tumor (MESH:D009369)
- **Species:** Mus musculus (house mouse, species) [taxon 10090], Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12863693/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12863693/full.md

## References

47 references — full list in the complete paper: https://tomesphere.com/paper/PMC12863693/full.md

---
Source: https://tomesphere.com/paper/PMC12863693