Gradient Norm Regularization Second-Order Algorithms for Solving Nonconvex-Strongly Concave Minimax Problems
Jun-Lin Wang, Zi Xu

TL;DR
This paper introduces second-order algorithms with gradient norm regularization for efficiently solving nonconvex-strongly concave minimax problems, achieving optimal iteration complexity bounds.
Contribution
It proposes a novel gradient norm regularized trust-region algorithm and a Levenberg-Marquardt variant that do not require solving trust-region subproblems, with proven optimal iteration complexities.
Findings
Achieves optimal iteration complexity for second-order stationary points.
Proposes algorithms that do not require solving trust-region subproblems.
Inexact variants maintain convergence with fewer Hessian-vector products.
Abstract
In this paper, we study second-order algorithms for solving nonconvex-strongly concave minimax problems, which have attracted much attention in recent years in many fields, especially in machine learning.We propose a gradient norm regularized trust-region (GRTR) algorithm to solve nonconvex-strongly concave minimax problems, where the objective function of the trust-region subproblem in each iteration uses a regularized version of the Hessian matrix, and the regularization coefficient and the radius of the ball constraint are proportional to the square root of the gradient norm. The iteration complexity of the proposed GRTR algorithm to obtain an -second-order stationary point is proved to be upper bounded by , where is the strong concave coefficient, and are the Lipschitz constant…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Numerical methods in inverse problems · Photoacoustic and Ultrasonic Imaging
MethodsSoftmax · Attention Is All You Need
