Hypergraph Representations of scRNA-seq Data for Improved Clustering with Random Walks
Wan He, Daniel I. Bolnick, Samuel V. Scarpino, and Tina Eliassi-Rad

TL;DR
This paper introduces hypergraph-based methods for analyzing scRNA-seq data, capturing multi-way relationships and improving clustering performance over traditional network approaches, especially in weakly modular data.
Contribution
The paper proposes two novel hypergraph walk algorithms, DIPHW and CoMem-DIPHW, that outperform existing methods by leveraging higher-order relationships in scRNA-seq data.
Findings
Hypergraph methods outperform traditional networks in clustering accuracy.
CoMem-DIPHW effectively integrates multiple data representations.
Significant improvements in weakly modular datasets.
Abstract
Analysis of single-cell RNA sequencing data is often conducted through network projections such as coexpression networks, primarily due to the abundant availability of network analysis tools for downstream tasks. However, this approach has several limitations: loss of higher-order information, inefficient data representation caused by converting a sparse dataset to a fully connected network, and overestimation of coexpression due to zero-inflation. To address these limitations, we propose conceptualizing scRNA-seq expression data as hypergraphs, which are generalized graphs in which the hyperedges can connect more than two vertices. In the context of scRNA-seq data, the hypergraph nodes represent cells and the edges represent genes. Each hyperedge connects all cells where its corresponding gene is actively expressed and records the expression of the gene across different cells. This…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Single-cell and spatial transcriptomics
