Two-Tier Mapper: a user-independent clustering method for global gene expression analysis based on topology
Rachel Jeitziner, Mathieu Carri\`ere, Jacques Rougemont and, Steve Oudot, Kathryn Hess, Cathrin Brisken

TL;DR
Two-Tier Mapper (TTMap) is an automated, topology-based clustering tool for gene expression data that detects subgroups and features with high sensitivity and stability, suitable for personalized medicine.
Contribution
TTMap introduces a novel, user-independent topological clustering method that adjusts for variability and outliers, outperforming existing methods in sensitivity and stability.
Findings
Outperforms current clustering methods in sensitivity and stability
Detects previously undetected gene expression changes in biological samples
Remains robust against sample removal, normalization, and data subselection
Abstract
There is a growing need for unbiased clustering methods, ideally automated. We have developed a topology-based analysis tool called Two-Tier Mapper (TTMap) to detect subgroups in global gene expression datasets and identify their distinguishing features. First, TTMap discerns and adjusts for highly variable features in the control group and identifies outliers. Second, the deviation of each test sample from the control group in a high-dimensional space is computed and the test samples are clustered in a global and local network using a new topological algorithm based on Mapper. Validation of TTMap on both synthetic and biological datasets shows that it outperforms current clustering methods in sensitivity and stability; clustering is not affected by removal of samples from the control group, choice of normalization nor subselection of data. There is no user induced bias because all…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBioinformatics and Genomic Networks · Topological and Geometric Data Analysis · Gene expression and cancer classification
