LSM-OPD: Boosting Scan in LSM-Trees by Enabling Direct Computing on Compressed Data
Jianfeng Huang, Ziyao Wang, Lin Yuan, Jiajie Wen, Yihao Cao, Dongjing Miao, Yong Wang, Jiahao Zhang

TL;DR
LSM-OPD introduces a novel encoding scheme that enables direct computation on compressed data in LSM-Trees, significantly improving scan performance and reducing I/O and computational bottlenecks on modern storage devices.
Contribution
The paper proposes LSM-OPD, a new encoding scheme that allows direct computation on compressed data, enhancing scan efficiency and offloading costly operations in LSM-Trees.
Findings
Significantly reduces I/O requests during scans.
Enables offloading of compaction and filtering to lightweight dictionaries.
Achieves superior performance across various storage devices.
Abstract
Scan-based operations, such as backstage compaction and value filtering, have emerged as the main bottleneck for LSM-Trees in supporting contemporary data-intensive applications. For slower external storage devices, such as HDD and SATA SSD, the scan performance is primarily limited by the I/O bandwidth (i.e., I/O bound) due to the substantial read/write amplifications in LSM-Trees. Recent adoption of high-performance storage devices, such as NVMe SSD, has transformed the main limitation to be compute-bound, emerging the impact of computational resource consumption caused by inefficient compactions and filtering. However, when the value size increases, the bottleneck for scan performance in fast devices gradually shifts towards the I/O bandwidth as well, and the overall throughput across all types of devices undergo a dramatic reduction. This paper addresses the core issues by proposing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
