In Search of the Fastest Concurrent Union-Find Algorithm
Dan Alistarh, Alexander Fedorov, and Nikita Koval

TL;DR
This paper evaluates the practical performance of various concurrent Union-Find algorithms across multiple platforms, introduces new cache-efficient techniques, and explores transactional memory to improve scalability and efficiency.
Contribution
It provides a comprehensive performance analysis of concurrent Union-Find algorithms, introduces novel cache-reduction techniques, and demonstrates the effectiveness of transactional memory in this context.
Findings
Cache misses are the main performance bottleneck.
New techniques reduce cache-related costs effectively.
Transactional Memory can significantly improve scalability.
Abstract
Union-Find (or Disjoint-Set Union) is one of the fundamental problems in computer science; it has been well-studied from both theoretical and practical perspectives in the sequential case. Recently, there has been mounting interest in analyzing this problem in the concurrent scenario, and several asymptotically-efficient algorithms have been proposed. Yet, to date, there is very little known about the practical performance of concurrent Union-Find. This work addresses this gap. We evaluate and analyze the performance of several concurrent Union-Find algorithms and optimization strategies across a wide range of platforms (Intel, AMD, and ARM) and workloads (social, random, and road networks, as well as integrations into more complex algorithms). We first observe that, due to the limited computational cost, the number of induced cache misses is the critical determining factor for the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
