Hessian Initialization Strategies for L-BFGS Solving Non-linear Inverse Problems
Hari Om Aggrawal, Jan Modersitzki

TL;DR
This paper introduces a new Hessian initialization method for L-BFGS that improves convergence speed and solution quality in large-scale inverse problems by effectively approximating the data-fidelity Hessian.
Contribution
The authors propose a novel Hessian initialization strategy using scalar approximations satisfying secant equations, enhancing L-BFGS performance in inverse problems.
Findings
Faster convergence in convex quadratic problems.
Improved solution quality over existing strategies.
Effective in non-convex image registration tasks.
Abstract
L-BFGS is the state-of-the-art optimization method for many large scale inverse problems. It has a small memory footprint and achieves superlinear convergence. The method approximates Hessian based on an initial approximation and an update rule that models current local curvature information. The initial approximation greatly affects the scaling of a search direction and the overall convergence of the method. We propose a novel, simple, and effective way to initialize the Hessian. Typically, the objective function is a sum of a data-fidelity term and a regularizer. Often, the Hessian of the data-fidelity is computationally challenging, but the regularizer's Hessian is easy to compute. We replace the Hessian of the data-fidelity with a scalar and keep the Hessian of the regularizer to initialize the Hessian approximation at every iteration. The scalar satisfies the secant equation in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Advanced Optimization Algorithms Research · Numerical methods in inverse problems
