Vectorization of Persistence Diagrams for Topological Data Analysis in R and Python Using TDAvec Package
Aleksei Luchinsky, Umar Islambekov

TL;DR
This paper introduces TDAvec, a software package that simplifies the vectorization of persistence diagrams, enabling their effective use in machine learning applications within topological data analysis.
Contribution
The paper presents a new R and Python package that streamlines the vectorization of persistence diagrams, enhancing their integration into machine learning workflows.
Findings
Provides an intuitive workflow for vectorizing persistence diagrams
Includes advanced functionalities for topological data analysis
Demonstrates practical applications with real data examples
Abstract
Persistent homology is a widely-used tool in topological data analysis (TDA) for understanding the underlying shape of complex data. By constructing a filtration of simplicial complexes from data points, it captures topological features such as connected components, loops, and voids across multiple scales. These features are encoded in persistence diagrams (PDs), which provide a concise summary of the data's topological structure. However, the non-Hilbert nature of the space of PDs poses challenges for their direct use in machine learning applications. To address this, kernel methods and vectorization techniques have been developed to transform PDs into machine-learning-compatible formats. In this paper, we introduce a new software package designed to streamline the vectorization of PDs, offering an intuitive workflow and advanced functionalities. We demonstrate the necessity of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopological and Geometric Data Analysis · Bioinformatics and Genomic Networks · Metabolomics and Mass Spectrometry Studies
