E2ETune: End-to-End Knob Tuning via Fine-tuned Generative Language Model
Xinmei Huang, Haoyang Li, Jing Zhang, Xinxin Zhao, Zhiming Yao, Yiyan, Li, Tieying Zhang, Jianjun Chen, Hong Chen, Cuiping Li

TL;DR
E2ETune leverages a fine-tuned generative language model to efficiently predict optimal database configurations for new workloads, reducing tuning time compared to traditional methods.
Contribution
The paper introduces a novel end-to-end knob tuning approach using a fine-tuned generative language model and a data generation framework for training.
Findings
E2ETune achieves faster configuration recommendations than state-of-the-art methods.
It maintains competitive performance across multiple benchmarks.
The approach reduces the need for workload replays and resource consumption.
Abstract
Database knob tuning is a significant challenge for database administrators, as it involves tuning a large number of configuration knobs with continuous or discrete values to achieve optimal database performance. Traditional methods, such as manual tuning or learning-based approaches, typically require numerous workload replays and are both time-consuming and resource-intensive. To address this challenge, we introduce E2ETune, an end-to-end knob tuner powered by a fine-tuned generative language model. The key idea is to leverage the exceptional sequence-to-sequence modeling capabilities of generative language models to capture the complex mapping between workloads (inputs) and their corresponding promising configurations (outputs). To achieve this goal, we propose a novel data generation framework to efficiently produce a large amount of training data, where each data sample consists of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Quality and Management · Semantic Web and Ontologies · Privacy-Preserving Technologies in Data
MethodsBalanced Selection
