Extremely Scalable Distributed Computation of Contour Trees via Pre-Simplification
Mingzhe Li, Hamish Carr, Oliver R\"ubel, Bei Wang, Gunther H. Weber

TL;DR
This paper introduces a pre-simplification method that reduces memory usage in distributed contour tree computations, enabling the processing of extremely large datasets efficiently and scaling to over half a trillion nodes.
Contribution
It presents a novel pre-simplification strategy that significantly improves scalability and memory efficiency in distributed contour tree analysis for large scientific datasets.
Findings
Constructed the largest known contour tree with over 500 billion nodes.
Achieved contour tree computation on 550-billion-element dataset in under 15 minutes.
Demonstrated improved scalability with strong scaling experiments.
Abstract
Contour trees offer an abstract representation of the level set topology in scalar fields and are widely used in topological data analysis and visualization. However, applying contour trees to large-scale scientific datasets remains challenging due to scalability limitations. Recent developments in distributed hierarchical contour trees have addressed these challenges by enabling scalable computation across distributed systems. Building on these structures, advanced analytical tasks -- such as volumetric branch decomposition and contour extraction -- have been introduced to facilitate large-scale scientific analysis. Despite these advancements, such analytical tasks substantially increase memory usage, which hampers scalability. In this paper, we propose a pre-simplification strategy to significantly reduce the memory overhead associated with analytical tasks on distributed hierarchical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopological and Geometric Data Analysis · Computer Graphics and Visualization Techniques · Data Visualization and Analytics
