Towards a General Framework for ML-based Self-tuning Databases
Thomas Schmied, Diego Didona, Andreas D\"oring, Thomas Parnell, and, Nikolas Ioannou

TL;DR
This paper explores applying ML-based self-tuning methods to FoundationDB, highlighting challenges like invalid configurations, and finds that simple random search performs competitively compared to complex ML techniques.
Contribution
It identifies key challenges in applying ML to database tuning and provides empirical comparison showing random search's effectiveness alongside Bayesian optimization and reinforcement learning.
Findings
ML methods improved FoundationDB throughput by up to 38%
Random search was only 4% worse than complex ML methods
Challenges like unknown parameter ranges are crucial for ML adoption in databases
Abstract
Machine learning (ML) methods have recently emerged as an effective way to perform automated parameter tuning of databases. State-of-the-art approaches include Bayesian optimization (BO) and reinforcement learning (RL). In this work, we describe our experience when applying these methods to a database not yet studied in this context: FoundationDB. Firstly, we describe the challenges we faced, such as unknown valid ranges of configuration parameters and combinations of parameter values that result in invalid runs, and how we mitigated them. While these issues are typically overlooked, we argue that they are a crucial barrier to the adoption of ML self-tuning techniques in databases, and thus deserve more attention from the research community. Secondly, we present experimental results obtained when tuning FoundationDB using ML methods. Unlike prior work in this domain, we also compare…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsRandom Search
