A Quasi-Newton Approach to Nonsmooth Convex Optimization Problems in Machine Learning
Jin Yu, S.V.N. Vishwanathan, Simon Guenter, and Nicol N. Schraudolph

TL;DR
This paper generalizes the BFGS quasi-Newton method for nonsmooth convex optimization, proving convergence and demonstrating competitive performance on machine learning tasks like risk minimization with hinge and logistic losses.
Contribution
It extends BFGS and LBFGS to nonsmooth convex problems, introduces an efficient line search for multiclass/multilabel settings, and proves convergence under certain conditions.
Findings
SubBFGS converges globally in objective value.
Our line search algorithm has proven worst-case complexity bounds.
Methods outperform or match state-of-the-art solvers on various datasets.
Abstract
We extend the well-known BFGS quasi-Newton method and its memory-limited variant LBFGS to the optimization of nonsmooth convex objectives. This is done in a rigorous fashion by generalizing three components of BFGS to subdifferentials: the local quadratic model, the identification of a descent direction, and the Wolfe line search conditions. We prove that under some technical conditions, the resulting subBFGS algorithm is globally convergent in objective function value. We apply its memory-limited variant (subLBFGS) to L_2-regularized risk minimization with the binary hinge loss. To extend our algorithm to the multiclass and multilabel settings, we develop a new, efficient, exact line search algorithm. We prove its worst-case time complexity bounds, and show that our line search can also be used to extend a recently developed bundle method to the multiclass and multilabel settings. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Advanced Optimization Algorithms Research · Machine Learning and Algorithms
