# scDBic: a novel deep learning-based biclustering algorithm for analyzing scRNA-seq data

**Authors:** Xiaoqi Tang, Caihua Liu, Chaowang Lan

PMC · DOI: 10.1093/bioinformatics/btag095 · Bioinformatics · 2026-02-26

## TL;DR

scDBic is a new deep learning method for analyzing single-cell RNA data that better identifies cell groups and their key genes.

## Contribution

scDBic introduces a novel deep learning-based biclustering algorithm with improved cell clustering and key gene identification.

## Key findings

- scDBic improves cell clustering performance compared to traditional and biclustering algorithms.
- The method identifies key genes for each cell group using a reverse strategy.
- The algorithm is freely available and outperforms existing techniques in scRNA-seq analysis.

## Abstract

Clustering single-cell RNA sequencing (scRNA-seq) data plays a vital role in the study of cellular heterogeneity. Many algorithms have been developed to cluster scRNA-seq data. However, traditional clustering algorithms often fail to capture local consistency, whereas biclustering algorithms suffer from issues such as cell loss, poor adaptability to high-dimensional data, and iterative selection challenges.

In this paper, we introduce scDBic, a novel deep learning-based biclustering algorithm specialized for scRNA-seq data. It comprises three main steps: cell clustering with a deep autoencoder, gene clustering, and identification of key gene clusters using the reverse strategy. The key idea is that the deep autoencoder captures the main information of gene expression and the reverse strategy identifies the key genes of cell groups. Therefore, cell clustering performance can be improved. The results demonstrate that our algorithm not only discovers cell groups in scRNA-seq data but also identifies the key genes of the cell groups. Furthermore, the clustering performance of our algorithm is better than that of traditional clustering and biclustering algorithms. This novel technique can be directly applied to discover cell groups and identify key genes in cell groups.

The source code and test data are freely available at GitHub (https://github.com/Xiaoqi-Tang/scDBic) and archived on Zenodo (DOI: 10.5281/zenodo.18676401).

## Full-text entities

- **Genes:** Cdx2 (caudal type homeobox 2) [NCBI Gene 12591] {aka Cdx-2}, H3c7 (H3 clustered histone 7) [NCBI Gene 260423] {aka H3.2-221, H3c13, H3c14, H3c15, H3c2, H3c3}, Sox21 (SRY (sex determining region Y)-box 21) [NCBI Gene 223227] {aka Sox25}, Carm1 (coactivator-associated arginine methyltransferase 1) [NCBI Gene 59035] {aka Prmt4, m9Bei}, CD8A (CD8 subunit alpha) [NCBI Gene 925] {aka CD8, CD8alpha, IMD116, Leu2, p32}, Pou5f1 (POU domain, class 5, transcription factor 1) [NCBI Gene 18999] {aka NF-A3, Oct-3, Oct-3/4, Oct-4, Oct3, Oct3/4}
- **Diseases:** autoimmune lymphoproliferative syndrome (MESH:D056735), NMI (MESH:C537354), SNN (MESH:D012753), ARI (MESH:D000275)
- **Chemicals:** E-MTAB-3321 (-)
- **Species:** Mus musculus (house mouse, species) [taxon 10090], Homo sapiens (human, species) [taxon 9606]
- **Cell lines:** E-MTAB-3321 — Homo sapiens (Human), Primary peritoneal serous papillary adenocarcinoma, Cancer cell line (CVCL_9Y93)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13012890/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13012890/full.md

## References

41 references — full list in the complete paper: https://tomesphere.com/paper/PMC13012890/full.md

---
Source: https://tomesphere.com/paper/PMC13012890