Learning Algorithm Hyperparameters for Fast Parametric Convex Optimization
Rajiv Sambharya, Bartolomeo Stellato

TL;DR
This paper presents a machine-learning framework to optimize hyperparameters of first-order methods for parametric convex problems, improving convergence speed and guaranteeing performance with minimal training data.
Contribution
It introduces a flexible, convergent learned optimizer that adapts hyperparameters across iterations and provides generalization guarantees for unseen data.
Findings
Effective hyperparameter learning for gradient-based algorithms
Achieves convergence guarantees and generalization bounds
Requires only 10 problem instances for training
Abstract
We introduce a machine-learning framework to learn the hyperparameter sequence of first-order methods (e.g., the step sizes in gradient descent) to quickly solve parametric convex optimization problems. Our computational architecture amounts to running fixed-point iterations where the hyperparameters are the same across all parametric instances and consists of two phases. In the first step-varying phase the hyperparameters vary across iterations, while in the second steady-state phase the hyperparameters are constant across iterations. Our learned optimizer is flexible in that it can be evaluated on any number of iterations and is guaranteed to converge to an optimal solution. To train, we minimize the mean square error to a ground truth solution. In the case of gradient descent, the one-step optimal step size is the solution to a least squares problem, and in the case of unconstrained…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Metaheuristic Optimization Algorithms Research · Machine Learning and Data Classification
