Polynomial Preconditioning for Gradient Methods

Nikita Doikov; Anton Rodomanov

arXiv:2301.13194·math.OC·January 31, 2023

Polynomial Preconditioning for Gradient Methods

Nikita Doikov, Anton Rodomanov

PDF

Open Access 1 Video

TL;DR

This paper introduces a novel polynomial preconditioning technique for gradient methods that improves convergence by better conditioning, applicable without explicit spectral knowledge, and demonstrates efficiency in machine learning tasks.

Contribution

It proposes a new family of polynomial preconditioners based on symmetric polynomials, enhancing gradient methods with provable condition number improvements and adaptive selection strategies.

Findings

01

Improved convergence rates for gradient methods using polynomial preconditioning.

02

Effective automatic selection of optimal polynomial preconditioners.

03

Numerical experiments show significant efficiency gains in machine learning applications.

Abstract

We study first-order methods with preconditioning for solving structured nonlinear convex optimization problems. We propose a new family of preconditioners generated by symmetric polynomials. They provide first-order optimization methods with a provable improvement of the condition number, cutting the gaps between highest eigenvalues, without explicit knowledge of the actual spectrum. We give a stochastic interpretation of this preconditioning in terms of coordinate volume sampling and compare it with other classical approaches, including the Chebyshev polynomials. We show how to incorporate a polynomial preconditioning into the Gradient and Fast Gradient Methods and establish the corresponding global complexity bounds. Finally, we propose a simple adaptive search procedure that automatically chooses the best possible polynomial preconditioning for the Gradient Method, minimizing the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Polynomial Preconditioning for Gradient Methods· slideslive

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Matrix Theory and Algorithms · Stochastic Gradient Optimization Techniques