Efficient Non-Parametric Optimizer Search for Diverse Tasks
Ruochen Wang, Yuanhao Xiong, Minhao Cheng, Cho-Jui Hsieh

TL;DR
This paper introduces a scalable, efficient, and general framework for optimizer search that models optimizer updates as paths in a super-tree, enabling effective discovery of superior optimizers with minimal evaluations.
Contribution
The authors propose a novel tree-structured search space for optimizer design and an adapted Monte Carlo tree search method, improving scalability and sample efficiency in optimizer discovery.
Findings
Discovered optimizers outperform human-designed and prior search methods.
Achieved effective optimizer search with only 128 evaluations.
Framework applicable across diverse tasks and models.
Abstract
Efficient and automated design of optimizers plays a crucial role in full-stack AutoML systems. However, prior methods in optimizer search are often limited by their scalability, generability, or sample efficiency. With the goal of democratizing research and application of optimizer search, we present the first efficient, scalable and generalizable framework that can directly search on the tasks of interest. We first observe that optimizer updates are fundamentally mathematical expressions applied to the gradient. Inspired by the innate tree structure of the underlying math expressions, we re-arrange the space of optimizers into a super-tree, where each path encodes an optimizer. This way, optimizer search can be naturally formulated as a path-finding problem, allowing a variety of well-established tree traversal methods to be used as the search algorithm. We adopt an adaptation of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMachine Learning and Data Classification · Software Engineering Research · Machine Learning and Algorithms
