AisLSM: Revolutionizing the Compaction with Asynchronous I/Os for LSM-tree
Yanpeng Hu, Li Zhu, Lei Jia, Chundong Wang

TL;DR
AisLSM introduces an asynchronous I/O approach to LSM-tree compaction, significantly improving performance and reducing latency by overlapping disk I/O with computation while ensuring data durability.
Contribution
The paper proposes AisLSM, a novel LSM-tree variant that employs asynchronous I/O for compaction, enhancing throughput and latency without compromising data safety.
Findings
Up to 2.14x performance boost over RocksDB
Significantly higher throughput than existing LSM variants
Lower tail latency in compaction processes
Abstract
The log-structured merge tree (LSM-tree) is widely employed to build key-value (KV) stores. LSM-tree organizes multiple levels in memory and on disk. The compaction of LSM-tree, which is used to redeploy KV pairs between on-disk levels in the form of SST files, severely stalls its foreground service. We overhaul and analyze the procedure of compaction. Writing and persisting files with fsyncs for compacted KV pairs are time-consuming and, more important, occur synchronously on the critical path of compaction. The user-space compaction thread of LSM-tree stays waiting for completion signals from a kernel-space thread that is processing file write and fsync I/Os. We accordingly design a new LSM-tree variant named AisLSM with an asynchronous I/O model. In short, AisLSM conducts asynchronous writes and fsyncs for SST files generated in a compaction and overlaps CPU computations with disk…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Storage Technologies · Parallel Computing and Optimization Techniques · Distributed and Parallel Computing Systems
