TL;DR
This extensive empirical study reveals that higher surrogate model accuracy does not always improve configuration tuning outcomes and can sometimes even worsen them, challenging the common belief that accuracy directly correlates with tuning quality.
Contribution
The paper provides the largest empirical analysis to date on the impact of surrogate model accuracy in configuration tuning, showing that accuracy is not always indicative of better tuning results and highlighting the need to reconsider current practices.
Findings
Higher accuracy often does not improve tuning outcomes (up to 58% cases)
In some cases, increased accuracy can degrade tuning quality (up to 24% cases)
Most proposed tuners use sub-optimal models and accuracy improvements vary with accuracy range
Abstract
To ease the expensive measurements during configuration tuning, it is natural to build a surrogate model as the replacement of the system, and thereby the configuration performance can be cheaply evaluated. Yet, a stereotype therein is that the higher the model accuracy, the better the tuning result would be. This "accuracy is all" belief drives our research community to build more and more accurate models and criticize a tuner for the inaccuracy of the model used. However, this practice raises some previously unaddressed questions, e.g., Do those somewhat small accuracy improvements reported in existing work really matter much to the tuners? What role does model accuracy play in the impact of tuning quality? To answer those related questions, we conduct one of the largest-scale empirical studies to date-running over the period of 13 months 24*7-that covers 10 models, 17 tuners, and 29…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
