A Quasi-Newton Approach to Nonsmooth Convex Optimization Problems in   Machine Learning

Jin Yu; S.V.N. Vishwanathan; Simon Guenter; and Nicol N. Schraudolph

arXiv:0804.3835·stat.ML·November 30, 2010·J. Mach. Learn. Res.·2 cites

A Quasi-Newton Approach to Nonsmooth Convex Optimization Problems in Machine Learning

Jin Yu, S.V.N. Vishwanathan, Simon Guenter, and Nicol N. Schraudolph

PDF

Open Access

TL;DR

This paper generalizes the BFGS quasi-Newton method for nonsmooth convex optimization, proving convergence and demonstrating competitive performance on machine learning tasks like risk minimization with hinge and logistic losses.

Contribution

It extends BFGS and LBFGS to nonsmooth convex problems, introduces an efficient line search for multiclass/multilabel settings, and proves convergence under certain conditions.

Findings

01

SubBFGS converges globally in objective value.

02

Our line search algorithm has proven worst-case complexity bounds.

03

Methods outperform or match state-of-the-art solvers on various datasets.

Abstract

We extend the well-known BFGS quasi-Newton method and its memory-limited variant LBFGS to the optimization of nonsmooth convex objectives. This is done in a rigorous fashion by generalizing three components of BFGS to subdifferentials: the local quadratic model, the identification of a descent direction, and the Wolfe line search conditions. We prove that under some technical conditions, the resulting subBFGS algorithm is globally convergent in objective function value. We apply its memory-limited variant (subLBFGS) to L_2-regularized risk minimization with the binary hinge loss. To extend our algorithm to the multiclass and multilabel settings, we develop a new, efficient, exact line search algorithm. We prove its worst-case time complexity bounds, and show that our line search can also be used to extend a recently developed bundle method to the multiclass and multilabel settings. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Advanced Optimization Algorithms Research · Machine Learning and Algorithms