Centrum: Model-based Database Auto-tuning with Minimal Distributional Assumptions
Yuanhao Lai, Pengfei Zheng, Chenpeng Ji, Yan Li, Songhan Zhang, Rutao Zhang, Zhengang Wang, Yunfei Du

TL;DR
Centrum introduces a novel, distribution-free model-based auto-tuning framework for DBMSs that leverages gradient boosting and conformal prediction to improve tuning efficiency and effectiveness.
Contribution
It is the first to combine gradient boosting ensembles with conformal inference for distribution-free Bayesian optimization in DBMS auto-tuning.
Findings
Centrum outperforms 21 state-of-the-art methods in experiments.
It achieves improved tuning efficiency and effectiveness.
The method is validated on multiple DBMSs and workloads.
Abstract
Gaussian-Process-based Bayesian optimization (GP-BO), is a prevailing model-based framework for DBMS auto-tuning. However, recent work shows GP-BO-based DBMS auto-tuners significantly outperformed auto-tuners based on SMAC, which features random forest surrogate models; such results motivate us to rethink and investigate the limitations of GP-BO in auto-tuner design. We find the fundamental assumptions of GP-BO are widely violated when modeling and optimizing DBMS performance, while tree-ensemble-BOs (e.g., SMAC) can avoid the assumption pitfalls and deliver improved tuning efficiency and effectiveness. Moreover, we argue that existing tree-ensemble-BOs restrict further advancement in DBMS auto-tuning. First, existing tree-ensemble-BOs can only achieve distribution-free point estimates, but still impose unrealistic distributional assumptions on uncertainty estimates, compromising…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
