Functional modules from variable genes: Leveraging percolation to analyze noisy, high-dimensional data
Steffen Werner, W Mathijs Rozemuller, Annabel Ebbing, Anna Alemany,, Joleen Traets, Jeroen S. van Zon, Alexander van Oudenaarden, Hendrik C., Korswagen, Greg J. Stephens, and Thomas S. Shimizu

TL;DR
This paper introduces a novel density-based clustering method leveraging percolation theory to identify functionally related gene groups in noisy, high-dimensional gene expression data, effectively distinguishing biological signals from noise.
Contribution
The authors develop a new clustering approach that exploits percolation transitions to robustly detect correlated gene modules in noisy datasets, applicable beyond gene expression.
Findings
Successfully applied to single-cell and whole-organism RNA-seq data
Revealed gene clusters related to cell cycle, development, and tissue functions
Demonstrated robustness and versatility across diverse biological datasets
Abstract
While measurement advances now allow extensive surveys of gene activity (large numbers of genes across many samples), interpretation of these data is often confounded by noise -- expression counts can differ strongly across samples due to variation of both biological and experimental origin. Complimentary to perturbation approaches, we extract functionally related groups of genes by analyzing the standing variation within a sampled population. To distinguish biologically meaningful patterns from uninterpretable noise, we focus on correlated variation and develop a novel density-based clustering approach that takes advantage of a percolation transition generically arising in random, uncorrelated data. We apply our approach to two contrasting RNA sequencing data sets that sample individual variation -- across single cells of fission yeast and whole animals of C. elegans worms -- and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBioinformatics and Genomic Networks · Gene Regulatory Network Analysis · Evolution and Genetic Dynamics
