PAM: Parallel Augmented Maps
Yihan Sun, Daniel Ferizovic, Guy E. Blelloch

TL;DR
PAM introduces a parallel, concurrent library for augmented ordered maps supporting fast range queries and sums, simplifying implementation and achieving high performance in large-scale data processing applications.
Contribution
The paper presents PAM, a novel parallel library for augmented maps with a flexible interface and efficient algorithms, enabling easier and faster implementation of complex data structures.
Findings
Achieves 40-90x speedup on 72 cores with hyperthreading.
Matches or exceeds existing libraries in sequential performance.
Successfully applied to multiple data processing applications.
Abstract
Ordered (key-value) maps are an important and widely-used data type for large-scale data processing frameworks. Beyond simple search, insertion and deletion, more advanced operations such as range extraction, filtering, and bulk updates form a critical part of these frameworks. We describe an interface for ordered maps that is augmented to support fast range queries and sums, and introduce a parallel and concurrent library called PAM (Parallel Augmented Maps) that implements the interface. The interface includes a wide variety of functions on augmented maps ranging from basic insertion and deletion to more interesting functions such as union, intersection, filtering, extracting ranges, splitting, and range-sums. We describe algorithms for these functions that are efficient both in theory and practice. As examples of the use of the interface and the performance of PAM, we apply the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
