Efficient Non-Parametric Optimizer Search for Diverse Tasks

Ruochen Wang; Yuanhao Xiong; Minhao Cheng; Cho-Jui Hsieh

arXiv:2209.13575·cs.LG·September 29, 2022

Efficient Non-Parametric Optimizer Search for Diverse Tasks

Ruochen Wang, Yuanhao Xiong, Minhao Cheng, Cho-Jui Hsieh

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a scalable, efficient, and general framework for optimizer search that models optimizer updates as paths in a super-tree, enabling effective discovery of superior optimizers with minimal evaluations.

Contribution

The authors propose a novel tree-structured search space for optimizer design and an adapted Monte Carlo tree search method, improving scalability and sample efficiency in optimizer discovery.

Findings

01

Discovered optimizers outperform human-designed and prior search methods.

02

Achieved effective optimizer search with only 128 evaluations.

03

Framework applicable across diverse tasks and models.

Abstract

Efficient and automated design of optimizers plays a crucial role in full-stack AutoML systems. However, prior methods in optimizer search are often limited by their scalability, generability, or sample efficiency. With the goal of democratizing research and application of optimizer search, we present the first efficient, scalable and generalizable framework that can directly search on the tasks of interest. We first observe that optimizer updates are fundamentally mathematical expressions applied to the gradient. Inspired by the innate tree structure of the underlying math expressions, we re-arrange the space of optimizers into a super-tree, where each path encodes an optimizer. This way, optimizer search can be naturally formulated as a path-finding problem, allowing a variety of well-established tree traversal methods to be used as the search algorithm. We adopt an adaptation of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ruocwang/efficient-optimizer-search
pytorchOfficial

Videos

Efficient Non-Parametric Optimizer Search for Diverse Tasks· slideslive

Taxonomy

TopicsMachine Learning and Data Classification · Software Engineering Research · Machine Learning and Algorithms