Characterize LSM-tree Compaction Performance via On-Device LLM Inference
Jiabiao Ding, Yina Lv, Qiao Li, Zhirong Shen, Chun Jason Xue

TL;DR
This paper explores using on-device small-scale LLMs for real-time tuning of LSM-tree compaction parameters in key-value storage engines, balancing inference latency and reasoning accuracy to optimize performance.
Contribution
It evaluates the trade-off between model size, inference latency, and tuning effectiveness, and characterizes LSM-tree performance with parameter adjustments using RocksDB.
Findings
Positive correlation between model capability and tuning effectiveness
Small-scale LLMs can achieve effective real-time compaction tuning
Analysis of inference latency across different LLM scales
Abstract
Modern key-value storage engines built on Log-Structured Merge-trees (LSM-trees), such as RocksDB and LevelDB, rely heavily on the performance of their compaction operations, which are impacted by a complex set of interdependent configuration parameters. Manually tuning these parameters for optimal performance demands considerable expertise, while traditional auto-tuning approaches struggle with the enormous search space and low sample efficiency inherent to this domain. In recent years, Large Language Models (LLMs) have demonstrated strong capabilities in code generation and logical reasoning, offering new possibilities for system optimization. However, applying LLMs to real-time compaction tuning in such latency-sensitive environments is a double-edged sword. While large-scale LLMs can offer superior reasoning for strategy generation, their high inference latency and computational…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Parallel Computing and Optimization Techniques · Advanced Data Storage Technologies
