# scSCCNIA: similarity matrix based contrastive clustering with neighbor information aggregation for single-cell RNA sequencing data

**Authors:** Jing Wang, Junfeng Xia, Yansen Su, Chun-Hou Zheng

PMC · DOI: 10.1093/bib/bbag094 · 2026-03-05

## TL;DR

This paper introduces scSCCNIA, a new method for analyzing single-cell RNA sequencing data to better identify cell types and their functions.

## Contribution

The novel scSCCNIA framework uses similarity-matrix-based contrastive learning and neighbor information aggregation for improved cell clustering.

## Key findings

- scSCCNIA outperforms existing methods in cell clustering and marker gene identification.
- The method reveals cell type heterogeneity and functional specificity through enrichment analyses.

## Abstract

The development of single-cell RNA sequencing (scRNA-seq) technology provides unprecedented opportunities for elucidating cell heterogeneity and gene expression. Identifying and discovering cell types through cell clustering is a crucial step in analyzing scRNA-seq data. However, the high-dimensionality nature and frequent dropout events of the data raise great challenges for cell clustering. Here, we propose a novel contrastive clustering framework called scSCCNIA (Similarity-matrix-based Contrastive Clustering with Neighbor Information Aggregation), for the accurate identification of cell clusters from scRNA-seq data. scSCCNIA adopts a Laplacian filter to conduct neighbor information aggregation, constructs different graph views by using special un-shared parameters Siamese encoders for data augmentation, and learns the latent low-dimensional embedding representations via similarity-matrix-based contrastive learning. Comparative analyses of multiple scRNA-seq datasets from different platforms and with varying cell numbers demonstrate that scSCCNIA outperforms existing methods in terms of cell clustering and marker gene identification. Furthermore, scSCCNIA reveals the heterogeneity and functional specificity of various cell types through Gene Ontology terms and Kyoto Encyclopedia of Genes and Genomes enrichment analyses. Overall, scSCCNIA is an effective algorithm for learning latent features from scRNA-seq data, enhancing cell type identification accuracy and facilitating downstream analyses of scRNA-seq data.

## Full-text entities

- **Diseases:** ARI (MESH:D000275), Coronavirus disease_COVID_19 (MESH:D000086382), tumor (MESH:D009369)
- **Species:** Mus musculus (house mouse, species) [taxon 10090], Homo sapiens (human, species) [taxon 9606]
- **Cell lines:** Camp1 — Homo sapiens (Human), Transformed cell line (CVCL_YI91)

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12962064/full.md

---
Source: https://tomesphere.com/paper/PMC12962064