Towards a General Framework for ML-based Self-tuning Databases

Thomas Schmied; Diego Didona; Andreas D\"oring; Thomas Parnell; and; Nikolas Ioannou

arXiv:2011.07921·cs.DB·April 28, 2021

Towards a General Framework for ML-based Self-tuning Databases

Thomas Schmied, Diego Didona, Andreas D\"oring, Thomas Parnell, and, Nikolas Ioannou

PDF

TL;DR

This paper explores applying ML-based self-tuning methods to FoundationDB, highlighting challenges like invalid configurations, and finds that simple random search performs competitively compared to complex ML techniques.

Contribution

It identifies key challenges in applying ML to database tuning and provides empirical comparison showing random search's effectiveness alongside Bayesian optimization and reinforcement learning.

Findings

01

ML methods improved FoundationDB throughput by up to 38%

02

Random search was only 4% worse than complex ML methods

03

Challenges like unknown parameter ranges are crucial for ML adoption in databases

Abstract

Machine learning (ML) methods have recently emerged as an effective way to perform automated parameter tuning of databases. State-of-the-art approaches include Bayesian optimization (BO) and reinforcement learning (RL). In this work, we describe our experience when applying these methods to a database not yet studied in this context: FoundationDB. Firstly, we describe the challenges we faced, such as unknown valid ranges of configuration parameters and combinations of parameter values that result in invalid runs, and how we mitigated them. While these issues are typically overlooked, we argue that they are a crucial barrier to the adoption of ML self-tuning techniques in databases, and thus deserve more attention from the research community. Secondly, we present experimental results obtained when tuning FoundationDB using ML methods. Unlike prior work in this domain, we also compare…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsRandom Search