On the Convergence Time of Dual Subgradient Methods for Strongly Convex Programs
Hao Yu, Michael J. Neely

TL;DR
This paper analyzes the convergence times of dual gradient methods for strongly convex programs, showing improved rates over general convex programs and introducing sliding averages for faster convergence under certain conditions.
Contribution
It establishes faster convergence bounds for dual gradient methods in strongly convex programs and introduces sliding running averages for even quicker convergence when the dual function is locally quadratic.
Findings
Convergence time for strongly convex programs is $O(1/\epsilon)$.
Sliding running averages achieve $O(\log(1/\epsilon))$ convergence under local quadraticity.
Numerical experiments confirm theoretical convergence rates.
Abstract
This paper studies the convergence time of dual gradient methods for general (possibly non-differentiable) strongly convex programs. For general convex programs, the convergence time of dual subgradient/gradient methods with simple running averages (running averages started from iteration ) is known to be . This paper shows that the convergence time for general strongly convex programs is . This paper also considers a variation of the average scheme, called the sliding running averages, and shows that if the dual function of the strongly convex program is locally quadratic (Note that the locally quadratic property is implied by the locally strongly concave property.) then the convergence time of the dual gradient method with sliding running averages is . The convergence time analysis is further verified by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Optimization Algorithms Research · Sparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques
