TL;DR
This paper introduces a scalable, parallel algorithm for depression filling in digital elevation models that efficiently handles trillion-cell datasets on desktops or clusters, significantly improving speed and resource use.
Contribution
A novel, linearly-scaling parallel algorithm for depression filling in DEMs that guarantees fixed memory access and communication, enabling processing of trillion-cell datasets.
Findings
Achieves ~60% scaling efficiency up to 48 cores
Runs linearly across datasets over three orders of magnitude
Processes 2 trillion-cell DEM in 4.8 hours with 48 cores
Abstract
Algorithms for extracting hydrologic features and properties from digital elevation models (DEMs) are challenged by large datasets, which often cannot fit within a computer's RAM. Depression filling is an important preconditioning step to many of these algorithms. Here, I present a new, linearly-scaling algorithm which parallelizes the Priority-Flood depression-filling algorithm by subdividing a DEM into tiles. Using a single-producer, multi-consumer design, the new algorithm works equally well on one core, multiple cores, or multiple machines and can take advantage of large memories or cope with small ones. Unlike previous algorithms, the new algorithm guarantees a fixed number of memory access and communication events per subdivision of the DEM. In comparison testing, this results in the new algorithm running generally faster while using fewer resources than previous algorithms. For…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
