TL;DR
This paper introduces SP-BFGS, a noise-robust quasi-Newton method that adaptively relaxes the secant condition using penalization, improving optimization performance under noisy gradient measurements.
Contribution
The paper proposes a novel secant penalized BFGS method that interpolates between updating and not updating the inverse Hessian, enhancing robustness to noisy gradients.
Findings
SP-BFGS outperforms BFGS in noisy environments.
The method maintains convergence properties under bounded noise.
Numerical experiments confirm improved stability and accuracy.
Abstract
In this paper, we introduce a new variant of the BFGS method designed to perform well when gradient measurements are corrupted by noise. We show that by treating the secant condition with a penalty method approach motivated by regularized least squares estimation, one can smoothly interpolate between updating the inverse Hessian approximation with the original BFGS update formula and not updating the inverse Hessian approximation. Furthermore, we find the curvature condition is smoothly relaxed as the interpolation moves towards not updating the inverse Hessian approximation, disappearing entirely when the inverse Hessian approximation is not updated. These developments allow us to develop a method we refer to as secant penalized BFGS (SP-BFGS) that allows one to relax the secant condition based on the amount of noise in the gradient measurements. SP-BFGS provides a means of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
