An optimization derivation of the method of conjugate gradients

David Ek; Anders Forsgren

arXiv:2011.02337·math.OC·April 2, 2021

An optimization derivation of the method of conjugate gradients

David Ek, Anders Forsgren

PDF

Open Access

TL;DR

This paper provides a new derivation of the conjugate gradients method by showing it minimizes a convex quadratic on the span of previous gradients, linking search directions to orthogonal gradients.

Contribution

It introduces a novel derivation of conjugate gradients based on convex quadratic minimization and orthogonal gradients, offering clearer insight into the method's geometric properties.

Findings

01

Conjugate gradients iterates minimize a convex quadratic on previous gradient span.

02

Search directions are orthogonal gradients scaled by a negative scalar.

03

The method relates to the Euclidean norm minimization on affine spans.

Abstract

We give a derivation of the method of conjugate gradients based on the requirement that each iterate minimizes a strictly convex quadratic on the space spanned by the previously observed gradients. Rather than verifying that the search direction has the correct properties, we show that generation of such iterates is equivalent to generation of orthogonal gradients which gives the description of the direction and the step length. Our approach gives a straightforward way to see that the search direction of the method of conjugate gradients is a negative scalar times the gradient of minimum Euclidean norm evaluated on the affine span of the iterates generated so far.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Optimization Algorithms Research · Field-Flow Fractionation Techniques · Mathematical Biology Tumor Growth