Linear Run Time of Persistent Homology Computation with GPU Parallelization
Michael G. Rawson

TL;DR
This paper presents a GPU-based parallelization approach that significantly reduces the computational complexity of persistent homology from $O(N^4)$ to potentially linear time, enabling analysis of larger datasets.
Contribution
The paper introduces a novel GPU parallelization method for persistent homology that reduces its computational complexity from $O(N^4)$ to $O(N)$, improving scalability.
Findings
GPU parallelization reduces run time from $O(N^4)$ to $O(N^2)$ and potentially to $O(N)$.
OpenMP parallelization yields a 1.75x speedup on dual-core machines.
Performance decreases with more threads due to overhead.
Abstract
Persistent homology is a crucial invariant that is used in many areas to understand data. The run time is a hindrance to its use on most large datasets. We give a parallelization method to utilize multi-core machines and clusters. We implement the computation of the persistent homology with OpenMP parallelization and observe a 1.75 fold performance increase by using 2 threads on a dual core machine. We also benchmark the computation using larger numbers of threads and show that the thread computational overhead decreases performance. With GPU parallelization, we analytically and empirically decrease the run time scaling from to and even where is the number of data points, for a large enough GPU. Next, we analytically show run time scaling for an even larger GPU.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopological and Geometric Data Analysis
