Generalized Bayesian nonparametric clustering framework for high-dimensional spatial omics data
Bencong Zhu, Guanyu Hu, Xiaodan Fan, and Qiwei Li

TL;DR
This paper introduces a Bayesian nonparametric clustering method for high-dimensional spatial transcriptomics data that automatically infers the number of spatial domains while preserving spatial information.
Contribution
It presents a novel BNPMFA model that integrates spatial constraints and infers the optimal number of clusters, overcoming limitations of existing multi-stage approaches.
Findings
Outperforms state-of-the-art methods in clustering accuracy
Accurately estimates the number of spatial domains
Provides new insights into cellular region identification
Abstract
The advent of next-generation sequencing-based spatially resolved transcriptomics (SRT) techniques has transformed genomic research by enabling high-throughput gene expression profiling while preserving spatial context. Identifying spatial domains within SRT data is a critical task, with numerous computational approaches currently available. However, most existing methods rely on a multi-stage process that involves ad-hoc dimension reduction techniques to manage the high dimensionality of SRT data. These low-dimensional embeddings are then subjected to model-based or distance-based clustering methods. Additionally, many approaches depend on arbitrarily specifying the number of clusters (i.e., spatial domains), which can result in information loss and suboptimal downstream analysis. To address these limitations, we propose a novel Bayesian nonparametric mixture of factor analysis…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification
