Distributed Discrete Morse Sandwich: Efficient Computation of Persistence Diagrams for Massive Scalar Data
Eve Le Guillou, Pierre Fortin, Julien Tierny

TL;DR
This paper extends the Discrete Morse Sandwich method to distributed-memory systems, enabling scalable and efficient computation of persistence diagrams for massive 3D scalar datasets across multiple compute nodes.
Contribution
We developed a distributed-memory parallel version of DMS with new algorithms and data structures, significantly improving scalability and performance for large datasets.
Findings
Achieved up to 8x speedup over DIPHA on 512 cores
Successfully computed a 6-billion-vertex dataset in 174 seconds
Demonstrated scalability up to 16 nodes with 512 cores
Abstract
The persistence diagram, which describes the topological features of a dataset, is a key descriptor in Topological Data Analysis. The "Discrete Morse Sandwich" (DMS) method has been reported to be the most efficient algorithm for computing persistence diagrams of 3D scalar fields on a single node, using shared-memory parallelism. In this work, we extend DMS to distributed-memory parallelism for the efficient and scalable computation of persistence diagrams for massive datasets across multiple compute nodes. On the one hand, we can leverage the embarrassingly parallel procedure of the first and most time-consuming step of DMS (namely the discrete gradient computation). On the other hand, the efficient distributed computations of the subsequent DMS steps are much more challenging. To address this, we have extensively revised the DMS routines by contributing a new self-correcting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopological and Geometric Data Analysis · Cell Image Analysis Techniques · Bioinformatics and Genomic Networks
