Derivation of Coordinate Descent Algorithms from Optimal Control Theory

I. M. Ross

arXiv:2309.03990·math.OC·September 11, 2023

Derivation of Coordinate Descent Algorithms from Optimal Control Theory

I. M. Ross

PDF

TL;DR

This paper demonstrates how coordinate descent algorithms can be systematically derived from optimal control theory, linking their convergence to Lyapunov functions and the Hessian of the objective.

Contribution

It introduces a novel derivation of coordinate descent algorithms from optimal control principles using Lyapunov functions and maximum principles.

Findings

01

Coordinate descent algorithms can be derived from optimal control theory.

02

Convergence is linked to Lyapunov function dissipation.

03

Hessian of the objective guides the search metric.

Abstract

Recently, it was posited that disparate optimization algorithms may be coalesced in terms of a central source emanating from optimal control theory. Here we further this proposition by showing how coordinate descent algorithms may be derived from this emerging new principle. In particular, we show that basic coordinate descent algorithms can be derived using a maximum principle and a collection of max functions as "control" Lyapunov functions. The convergence of the resulting coordinate descent algorithms is thus connected to the controlled dissipation of their corresponding Lyapunov functions. The operational metric for the search vector in all cases is given by the Hessian of the convex objective function.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.