# Iterative clustering algorithm G-DESC-E and pan-cancer key gene analysis based on single-cell sequencing data

**Authors:** Ke Wu, Changming Sun, Jie Geng, Ping Wang, Qi Dai, Leyi Wei, Ran Su

PMC · DOI: 10.1093/bib/bbaf288 · Briefings in Bioinformatics · 2025-07-03

## TL;DR

This paper introduces a new clustering algorithm, G-DESC-E, and uses single-cell sequencing data to identify key genes involved in various cancer types.

## Contribution

The novel G-DESC-E algorithm improves clustering accuracy and enables pan-cancer key gene analysis using single-cell data.

## Key findings

- The G-DESC-E algorithm effectively enhances clustering accuracy in single-cell data.
- Over thirty genes potentially related to cancer occurrence and progression were identified.
- The methodology successfully distinguishes transcriptional features across different cancer subtypes.

## Abstract

Single-cell sequencing technology has profoundly revolutionized the field of cancer genomics, enabling researchers to explore gene expression profiles at the resolution of individual cells. Despite its extensive applications in the study of cancer gene states, pan-cancer analyses remain relatively underexplored. In this study, we propose the G-DESC-E algorithm, which effectively distinguishes dimensionality-reduced data through a grid-based approach, filters out outliers during the preprocessing phase, and employs the Louvain algorithm for prescreening cluster centroids as initial clusters. We construct an objective function by integrating label entropy with the Kullback–Leibler divergence formula, achieving final clustering results through iterative optimization. Our findings demonstrate the effectiveness of the G-DESC-E algorithm in enhancing clustering accuracy. By applying our methodology to real-world datasets, we illustrate its capability to identify critical transcriptional features associated with distinct cancer subtypes. Coupled with clustering visualization and gene ontology analysis, we identify over thirty genes potentially related to cancer occurrence and progression. The algorithm and research framework presented in this study pave the way for new directions in clinical research by applying single-cell sequencing technology to the analysis of key genes within the realm of pan-cancer analysis for the first time. This approach offers valuable insights that can inform further clinical investigations.

## Linked entities

- **Diseases:** cancer (MONDO:0004992)

## Full-text entities

- **Diseases:** cancer (MESH:D009369)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12224615/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12224615/full.md

## References

26 references — full list in the complete paper: https://tomesphere.com/paper/PMC12224615/full.md

---
Source: https://tomesphere.com/paper/PMC12224615