A Generalization Result for Convergence in Learning-to-Optimize
Michael Sucker, Peter Ochs

TL;DR
This paper introduces a probabilistic framework for analyzing convergence in learning-to-optimize, providing theoretical guarantees that learned algorithms converge to critical points with high probability, thus bridging empirical success and theoretical understanding.
Contribution
It develops a novel probabilistic approach that extends classical geometric convergence arguments to learned optimization algorithms, enabling theoretical guarantees for non-convex, non-smooth loss functions.
Findings
Establishes high-probability convergence of learned algorithms to critical points.
Generalizes worst-case analysis into a probabilistic framework.
Removes the need for safeguards in algorithm design.
Abstract
Learning-to-optimize leverages machine learning to accelerate optimization algorithms. While empirical results show tremendous improvements compared to classical optimization algorithms, theoretical guarantees are mostly lacking, such that the outcome cannot be reliably assured. Especially, convergence is hardly studied in learning-to-optimize, because conventional convergence guarantees in optimization are based on geometric arguments, which cannot be applied easily to learned algorithms. Thus, we develop a probabilistic framework that resembles classical optimization and allows for transferring geometric arguments into learning-to-optimize. Based on our new proof-strategy, our main theorem is a generalization result for parametric classes of potentially non-smooth, non-convex loss functions and establishes the convergence of learned optimization algorithms to critical points with high…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMetaheuristic Optimization Algorithms Research
