Are Joins over LSM-Trees Ready? Take RocksDB as an Example
Weiping Yu, Fan Wang, Xuwei Zhang, Siqiang Luo

TL;DR
This paper systematically evaluates the performance of various join methods over LSM-trees, combining theoretical analysis and extensive benchmarking to guide developers in selecting optimal join strategies for LSM-based data stores.
Contribution
It provides the first comprehensive benchmark and analysis of join algorithms over LSM-trees, considering different configurations and implementation strategies.
Findings
Certain join methods outperform others depending on workload and index type.
Theoretical overhead analysis aligns with experimental performance results.
Guidelines are provided for choosing join methods in LSM-based systems.
Abstract
LSM-tree-based data stores are widely adopted in industries for their excellent performance. As data scales increase, disk-based join operations become indispensable yet costly for the database, making the selection of suitable join methods crucial for system optimization. Current LSM-based stores generally adhere to conventional relational database practices and support only a limited number of join methods. However, the LSM-tree delivers distinct read and write efficiency compared to the relational databases, which could accordingly impact the performance of various join methods. Therefore, it is necessary to reconsider the selection of join methods in this context to fully explore the potential of various join algorithms and index designs. In this work, we present a systematic study and an exhaustive benchmark for joins over LSM-trees. We define a configuration space for join…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Database Systems and Queries · Semantic Web and Ontologies · Data Mining Algorithms and Applications
