Feasible-First Exploration for Constrained ML Deployment Optimization in Crash-Prone Hierarchical Search Spaces
Christian Lysenst{\o}en

TL;DR
This paper introduces a feasible-first exploration method called TBA for optimizing constrained machine learning deployment, effectively navigating hostile hierarchical search spaces to improve model discovery and reduce wasted evaluations.
Contribution
It proposes Thermal Budget Annealing (TBA), a novel exploration strategy that enhances black-box optimization in constrained deployment scenarios by mapping feasible regions before exploitation.
Findings
TBA improves model discovery under tight constraints.
The method reduces wasted evaluations compared to cold-start TPE.
Experiments on synthetic and real GPU benchmarks validate effectiveness.
Abstract
Deploying machine learning models under production constraints requires joint optimization over model family, quantization scheme, runtime backend, and serving configuration. This induces a hierarchical mixed-variable search space in which many configurations are invalid: evaluations may crash, exceed memory limits, or violate latency constraints. Standard black-box optimizers such as Tree-structured Parzen Estimators (TPE) and constrained Bayesian optimization are effective when valid configurations are common, but they can spend a large fraction of a small evaluation budget on invalid or uninformative trials in hostile deployment spaces. This paper studies that regime and asks whether optimization should be decomposed into an explicit exploration stage followed by model-guided exploitation. We propose Thermal Budget Annealing (TBA), a feasible-first exploration procedure that maps…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
