Learning Algorithm Hyperparameters for Fast Parametric Convex   Optimization

Rajiv Sambharya; Bartolomeo Stellato

arXiv:2411.15717·math.OC·December 23, 2024

Learning Algorithm Hyperparameters for Fast Parametric Convex Optimization

Rajiv Sambharya, Bartolomeo Stellato

PDF

Open Access

TL;DR

This paper presents a machine-learning framework to optimize hyperparameters of first-order methods for parametric convex problems, improving convergence speed and guaranteeing performance with minimal training data.

Contribution

It introduces a flexible, convergent learned optimizer that adapts hyperparameters across iterations and provides generalization guarantees for unseen data.

Findings

01

Effective hyperparameter learning for gradient-based algorithms

02

Achieves convergence guarantees and generalization bounds

03

Requires only 10 problem instances for training

Abstract

We introduce a machine-learning framework to learn the hyperparameter sequence of first-order methods (e.g., the step sizes in gradient descent) to quickly solve parametric convex optimization problems. Our computational architecture amounts to running fixed-point iterations where the hyperparameters are the same across all parametric instances and consists of two phases. In the first step-varying phase the hyperparameters vary across iterations, while in the second steady-state phase the hyperparameters are constant across iterations. Our learned optimizer is flexible in that it can be evaluated on any number of iterations and is guaranteed to converge to an optimal solution. To train, we minimize the mean square error to a ground truth solution. In the case of gradient descent, the one-step optimal step size is the solution to a least squares problem, and in the case of unconstrained…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace and Expression Recognition · Metaheuristic Optimization Algorithms Research · Machine Learning and Data Classification