Higher-Order Corrections to Optimisers based on Newton's Method

S. J. Brooks

arXiv:2307.03820·math.NA·August 1, 2025

Higher-Order Corrections to Optimisers based on Newton's Method

S. J. Brooks

PDF

Open Access

TL;DR

This paper introduces higher-order correction techniques based on geodesic acceleration to improve Newton-like optimization methods, significantly reducing steps and computation time in nonlinear least squares problems.

Contribution

It develops a differential equation framework for higher-order corrections, enabling 2nd to 4th order improvements to Levenberg--Marquardt and similar algorithms.

Findings

01

Higher-order methods reduce optimization steps.

02

Significant decrease in computation time.

03

Effective for ill-conditioned Jacobian problems.

Abstract

The Newton, Gauss--Newton and Levenberg--Marquardt methods all use the first derivative of a vector function (the Jacobian) to minimise its sum of squares. When the Jacobian matrix is ill-conditioned, the function varies much faster in some directions than others and the space of possible improvement in sum of squares becomes a long narrow ellipsoid in the linear model. This means that even a small amount of nonlinearity in the problem parameters can cause a proposed point far down the long axis of the ellipsoid to fall outside of the actual curved valley of improved values, even though it is quite nearby. This paper presents a differential equation that `follows' these valleys, based on the technique of geodesic acceleration, which itself provides a 2 $^{nd}$ order improvement to the Levenberg--Marquardt iteration step. Higher derivatives of this equation are computed that allow…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Optimization Algorithms Research · Iterative Methods for Nonlinear Equations · Robotic Mechanisms and Dynamics