Nonlinear conjugate gradient for smooth convex functions
Sahar Karimi, Stephen Vavasis

TL;DR
This paper introduces C+AG, a hybrid nonlinear conjugate gradient method that combines conjugate gradient and accelerated gradient steps, achieving optimal complexity bounds for smooth convex functions and quadratic functions.
Contribution
The paper proposes C+AG, a novel NCG method that is optimal for quadratic functions and maintains the best complexity bounds for general smooth convex functions.
Findings
C+AG is identical to conjugate gradient on quadratic functions.
C+AG achieves $O(rac{1}{ oot 2 olinebreak ext{eps}})$ complexity for smooth convex functions.
C+AG often outperforms classical NCG and accelerated gradient in computational tests.
Abstract
The method of nonlinear conjugate gradients (NCG) is widely used in practice for unconstrained optimization, but it satisfies weak complexity bounds at best when applied to smooth convex functions. In contrast, Nesterov's accelerated gradient (AG) method is optimal up to constant factors for this class. However, when specialized to quadratic function, conjugate gradient is optimal in a strong sense among function-gradient methods. Therefore, there is seemingly a gap in the menu of available algorithms: NCG, the optimal algorithm for quadratic functions that also exhibits good practical performance for general functions, has poor complexity bounds compared to AG. We propose an NCG method called C+AG ("conjugate plus accelerated gradient") to close this gap, that is, it is optimal for quadratic functions and still satisfies the best possible complexity bound for more general smooth convex…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Intraoperative Neuromonitoring and Anesthetic Effects
