Persistent Homology for High-dimensional Data Based on Spectral Methods
Sebastian Damrich, Philipp Berens, Dmitry Kobak

TL;DR
This paper introduces spectral distance methods like diffusion distance and effective resistance to improve persistent homology analysis of high-dimensional data, demonstrating robustness against noise and successful application to single-cell RNA-sequencing data.
Contribution
It presents a novel closed-form formula for effective resistance and shows how spectral distances enhance topological data analysis in high-dimensional noisy settings.
Findings
Spectral distances detect correct topology in noisy high-dimensional data.
Effective resistance formula derived and related to diffusion distances.
Application to single-cell RNA-sequencing data reveals cell cycle loops.
Abstract
Persistent homology is a popular computational tool for analyzing the topology of point clouds, such as the presence of loops or voids. However, many real-world datasets with low intrinsic dimensionality reside in an ambient space of much higher dimensionality. We show that in this case traditional persistent homology becomes very sensitive to noise and fails to detect the correct topology. The same holds true for existing refinements of persistent homology. As a remedy, we find that spectral distances on the k-nearest-neighbor graph of the data, such as diffusion distance and effective resistance, allow to detect the correct topology even in the presence of high-dimensional noise. Moreover, we derive a novel closed-form formula for effective resistance, and describe its relation to diffusion distances. Finally, we apply these methods to high-dimensional single-cell RNA-sequencing data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopological and Geometric Data Analysis · Neuroinflammation and Neurodegeneration Mechanisms · Single-cell and spatial transcriptomics
MethodsDiffusion
