Curse of Dimensionality on Persistence Diagrams
Yasuaki Hiraoka (2, 3), Yusuke Imoto (2), Shu Kanazawa (3), Enhao Liu (1) ((1) Department of Mathematics, Kyoto University, (2) Institute for the Advanced Study of Human Biology, Kyoto University, (3) Kyoto University Institute for Advanced Study, Kyoto University)

TL;DR
This paper investigates how the reliability of persistence diagrams deteriorates in high-dimensional, low-sample-size data due to the curse of dimensionality, and explores PCA-based dimensionality reduction as a mitigation strategy.
Contribution
It demonstrates the loss of reliability of persistence diagrams in high-dimensional noise settings and proposes PCA for dimensionality reduction to address this issue.
Findings
Persistence diagrams become unreliable in high-dimensional noise.
PCA can mitigate the curse of dimensionality on persistence diagrams.
The study provides insights into processing high-dimensional low-sample-size data.
Abstract
The stability of persistent homology has led to wide applications of the persistence diagram as a trusted topological descriptor in the presence of noise. However, with the increasing demand for high-dimension and low-sample-size data processing in modern science, it is questionable whether persistence diagrams retain their reliability in the presence of high-dimensional noise. This work aims to study the reliability of persistence diagrams in the high-dimension low-sample-size data setting. By analyzing the asymptotic behavior of persistence diagrams for high-dimensional random data, we show that persistence diagrams are no longer reliable descriptors of low-sample-size data under high-dimensional noise perturbations. We refer to this loss of reliability of persistence diagrams in such data settings as the curse of dimensionality on persistence diagrams. Next, we investigate the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopological and Geometric Data Analysis · Data Visualization and Analytics
