Preconditioned Proximal Gradient Methods with Conjugate Momentum: A Subspace Perspective
Jian Chen, Xinmin Yang

TL;DR
This paper introduces a novel subspace proximal Newton method with conjugate momentum for composite optimization, leveraging structure-exploiting preconditioners and low-dimensional subspace curvature information to enhance convergence on high-dimensional, ill-conditioned problems.
Contribution
It develops a subspace proximal Newton framework that combines preconditioning, momentum, and orthogonalization to improve convergence efficiency in composite optimization.
Findings
Global convergence is established.
Achieves Q-linear convergence under strong convexity.
Numerical experiments show superior performance on high-dimensional problems.
Abstract
In this paper, we propose a descent method for composite optimization problems with linear operators. Specifically, we first design a structure-exploiting preconditioner tailored to the linear operator so that the resulting preconditioned proximal subproblem admits a closed-form solution through its dual formulation. However, such a structure-driven preconditioner may be poorly aligned with the local curvature of the smooth component, which can lead to slow practical convergence. To address this issue, we develop a subspace proximal Newton framework that incorporates curvature information within a low-dimensional subspace. At each iteration, the search direction is obtained by minimizing a proximal Newton model restricted to a two-dimensional subspace spanned by the current preconditioned proximal gradient direction and a momentum direction derived from the previous iterate. By…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Advanced Optimization Algorithms Research · Matrix Theory and Algorithms
