LSEMINK: A Modified Newton-Krylov Method for Log-Sum-Exp Minimization
Kelvin Kan, James G. Nagy, Lars Ruthotto

TL;DR
LSEMINK is a modified Newton-Krylov algorithm designed for efficient and scalable minimization of the log-sum-exp function, with proven convergence and improved performance in large-scale machine learning and geometric programming tasks.
Contribution
The paper introduces LSEMINK, a novel modified Newton-Krylov method that ensures bounded quadratic approximations and demonstrates superior scalability and robustness for log-sum-exp minimization.
Findings
Reduces time-to-solution in large-scale problems
Achieves faster initial convergence than standard methods
More robust to ill-conditioning
Abstract
This paper introduces LSEMINK, an effective modified Newton-Krylov algorithm geared toward minimizing the log-sum-exp function for a linear model. Problems of this kind arise commonly, for example, in geometric programming and multinomial logistic regression. Although the log-sum-exp function is smooth and convex, standard line search Newton-type methods can become inefficient because the quadratic approximation of the objective function can be unbounded from below. To circumvent this, LSEMINK modifies the Hessian by adding a shift in the row space of the linear model. We show that the shift renders the quadratic approximation to be bounded from below and that the overall scheme converges to a global minimizer under mild assumptions. Our convergence proof also shows that all iterates are in the row space of the linear model, which can be attractive when the model parameters do not have…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMatrix Theory and Algorithms · Advanced Optimization Algorithms Research · Sparse and Compressive Sensing Techniques
