CompassDB: Pioneering High-Performance Key-Value Store with Perfect Hash
Jin Jiang, Dongsheng He, Yu Hu, Dong Liu, Chenfan Xiao, Hongxiao Bi,, Yusong Zhang, Chaoqu Jiang, Zhijun Fu

TL;DR
CompassDB introduces a high-performance key-value store using a perfect hash-based index to significantly reduce read/write amplification, improving throughput and latency over existing LSM-tree based engines especially on SSDs.
Contribution
It presents a novel Two-tier Perfect Hash Table design that drastically decreases index size and lookup time, outperforming traditional LSM-tree based storage engines.
Findings
Increases throughput by 2.5x to 4x over RocksDB.
Reduces latency by 50% to 85% compared to RocksDB.
Achieves 5x to 17x throughput improvement over PebblesDB.
Abstract
Modern mainstream persistent key-value storage engines utilize Log-Structured Merge tree (LSM-tree) based designs, optimizing read/write performance by leveraging sequential disk I/O. However, the advent of SSDs, with their significant improvements in bandwidth and IOPS, shifts the bottleneck from I/O to CPU. The high compaction cost and large read/write amplification associated with LSM trees have become critical bottlenecks. In this paper, we introduce CompassDB, which utilizes a Two-tier Perfect Hash Table (TPH) design to significantly decrease read/write amplification and compaction costs. CompassDB utilizes a perfect hash algorithm for its in-memory index, resulting in an average index cost of about 6 bytes per key-value pair. This compact index reduces the lookup time complexity from to and decreases the overall cost. Consequently, it allows for the storage of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
