On the Finite Time Convergence of Cyclic Coordinate Descent Methods
Ankan Saha, Ambuj Tewari

TL;DR
This paper establishes finite-time convergence rates for cyclic coordinate descent methods in smooth optimization problems, demonstrating their competitive performance and stability compared to gradient descent.
Contribution
It provides the first $O(1/k)$ convergence rate analysis for cyclic coordinate descent under an isotonicity assumption, filling a key gap in the literature.
Findings
Cyclic coordinate descent achieves $O(1/k)$ convergence rates.
Iterates of cyclic coordinate descent outperform gradient descent.
The analysis compares two variants of cyclic coordinate descent.
Abstract
Cyclic coordinate descent is a classic optimization method that has witnessed a resurgence of interest in machine learning. Reasons for this include its simplicity, speed and stability, as well as its competitive performance on regularized smooth optimization problems. Surprisingly, very little is known about its finite time convergence behavior on these problems. Most existing results either just prove convergence or provide asymptotic rates. We fill this gap in the literature by proving convergence rates (where is the iteration counter) for two variants of cyclic coordinate descent under an isotonicity assumption. Our analysis proceeds by comparing the objective values attained by the two variants with each other, as well as with the gradient descent algorithm. We show that the iterates generated by the cyclic coordinate descent methods remain better than those…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Advanced Optimization Algorithms Research
