# scGACL: a generative adversarial network with multi-scale contrastive learning for accurate single-cell RNA sequencing imputation

**Authors:** Yanlin Jiang, Mengyuan Zhao, Jiahui Yan, Jijun Tang, Fei Guo

PMC · DOI: 10.1093/bib/bbag018 · Briefings in Bioinformatics · 2026-02-03

## TL;DR

This paper introduces scGACL, a new method using GANs and contrastive learning to improve single-cell RNA sequencing data imputation while preserving cell heterogeneity.

## Contribution

scGACL integrates GANs with multi-scale contrastive learning to address over-smoothing in single-cell RNA imputation.

## Key findings

- scGACL outperforms existing methods in recovering gene expression from single-cell RNA data.
- The method preserves both fine-grained and macroscopic biological variations in imputed data.
- It improves downstream analyses like cell clustering and trajectory inference.

## Abstract

Single-cell RNA sequencing is a powerful technology for investigating cell-to-cell heterogeneity, yet its application is often hindered by dropout events, making accurate imputation essential for downstream analyses. Existing imputation methods, however, frequently suffer from the over-smoothing problem, which results in the loss of cell-to-cell heterogeneity in the imputed outcomes and affects downstream analyses. To overcome this limitation, we propose scGACL, a generative adversarial network (GAN) integrated with multi-scale contrastive learning. The GAN architecture facilitates the distribution of the imputed data to approximate that of the real data. To fundamentally address over-smoothing, the model incorporates a multi-scale contrastive learning mechanism: cell-level contrastive learning preserves fine-grained cell-to-cell heterogeneity, while cell-type-level contrastive learning maintains macroscopic biological variation across different cellular groups. These mechanisms function synergistically to ensure accurate imputation and effectively address the over-smoothing challenge. Comprehensive evaluations across diverse simulated and real-world datasets confirm that scGACL consistently outperforms existing methods in accurately recovering gene expression and improving downstream analyses such as cell clustering, gene differential expression analysis, and cell trajectory inference.

## Full-text entities

- **Diseases:** lung adenocarcinoma (MESH:D000077192), ARI (MESH:D000275), lipoma (MESH:D008067)
- **Species:** Homo sapiens (human, species) [taxon 9606], Mus musculus (house mouse, species) [taxon 10090]
- **Cell lines:** H1975 — Homo sapiens (Human), Lung adenocarcinoma, Cancer cell line (CVCL_1511), H2228 — Homo sapiens (Human), Lung adenocarcinoma, Cancer cell line (CVCL_1543), sc_10x — Homo sapiens (Human), Follicular lymphoma, Cancer cell line (CVCL_1888), HCC827 — Homo sapiens (Human), Lung adenocarcinoma, Cancer cell line (CVCL_2063)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12866930/full.md

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12866930/full.md

## References

41 references — full list in the complete paper: https://tomesphere.com/paper/PMC12866930/full.md

---
Source: https://tomesphere.com/paper/PMC12866930