A computationally efficient framework for vector representation of persistence diagrams
Kit C. Chan, Umar Islambekov, Alexey Luchinsky, Rebecca Sanders

TL;DR
This paper introduces a new, efficient method to convert persistence diagrams from Topological Data Analysis into vector form, enabling better integration with machine learning tasks.
Contribution
The authors propose the vectorized persistence block (VPB), a novel, stable, and computationally efficient representation of persistence diagrams for machine learning applications.
Findings
VPBs are stable to input noise.
VPBs are computationally efficient.
VPBs improve performance in clustering, classification, and change point detection.
Abstract
In Topological Data Analysis, a common way of quantifying the shape of data is to use a persistence diagram (PD). PDs are multisets of points in computed using tools of algebraic topology. However, this multi-set structure limits the utility of PDs in applications. Therefore, in recent years efforts have been directed towards extracting informative and efficient summaries from PDs to broaden the scope of their use for machine learning tasks. We propose a computationally efficient framework to convert a PD into a vector in , called a vectorized persistence block (VPB). We show that our representation possesses many of the desired properties of vector-based summaries such as stability with respect to input noise, low computational cost and flexibility. Through simulation studies, we demonstrate the effectiveness of VPBs in terms of performance and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopological and Geometric Data Analysis · Cell Image Analysis Techniques · Homotopy and Cohomology in Algebraic Topology
