Analysis of Schedule-Free Nonconvex Optimization
Connor Brown

TL;DR
This paper develops a Lyapunov-based analysis for schedule-free first-order methods in nonconvex optimization, providing horizon-independent convergence guarantees and validating them through numerical experiments.
Contribution
It introduces a robust Lyapunov framework that extends schedule-free methods' convergence guarantees to nonconvex settings with minimal assumptions.
Findings
Horizon-agnostic convergence bounds for nonconvex SF methods.
Validation of theoretical rates through PEP experiments.
Potential for tighter bounds on the original SF algorithm.
Abstract
First-order methods underpin most large-scale learning algorithms, yet their classical convergence guarantees hinge on carefully scheduled step-sizes that depend on the total horizon , which is rarely known in advance. The Schedule-Free (SF) method promises optimal performance with hyperparameters that are independent of by interpolating between Polyak--Ruppert averaging and momentum, but nonconvex analysis of SF has been limited or reliant on strong global assumptions. We introduce a robust Lyapunov framework that, under only -smoothness and lower-boundedness, reduces SF analysis to a single-step descent inequality. This yields horizon-agnostic bounds in the nonconvex setting: for constant step + PR averaging, for a linearly growing step-size, and a continuum of rates for polynomial averaging. We complement these proofs with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Advanced Bandit Algorithms Research
