FliX: Flipped-Indexing for Scalable GPU Queries and Updates
Rosina Kharal, Trevor Brown, Justus Henneberg, Felix Schuhknecht

TL;DR
FliX introduces a flipped indexing strategy for GPU data structures that significantly improves query latency and throughput by eliminating the index layer and simplifying updates.
Contribution
The paper proposes a novel flipped indexing paradigm for fully GPU-resident data structures, reducing work and contention during queries and updates.
Findings
FliX reduces query latency by up to 6.5x compared to GPU B-trees.
FliX achieves 4x higher throughput per memory footprint than ordered competitors.
FliX outperforms dynamic ordered baselines by over 8x in insertion throughput.
Abstract
GPU-based concurrent data structures (CDSs) achieve high throughput for read-only queries, but efficient support for dynamic updates on fully GPU-resident data remains challenging. Ordered CDSs (e.g., B-trees and LSM-trees) maintain an index layer that directs operations to a data layer (buckets or leaves), while hash tables avoid the cost of maintaining order but do not support range or successor queries. On GPUs, maintaining and traversing an index layer under frequent updates introduces contention and warp divergence. To tackle these problems, we flip the indexing paradigm on its head with FliX, a comparison-based, flipped indexing strategy for dynamic, fully GPU-resident CDSs. Traditional GPU CDSs typically take a batch of operations and assign each operation to a GPU thread or warp. FliX, however, assigns compute (e.g., a warp) to each bucket in the data layer, and each bucket…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
