Approximate Stochastic Subgradient Estimation Training for Support Vector Machines
Sangkyun Lee, Stephen J. Wright

TL;DR
This paper introduces efficient stochastic subgradient methods for training support vector machines with nonlinear kernels, using randomized approximations and reduced primal formulations, achieving comparable accuracy faster.
Contribution
It presents novel subgradient algorithms that handle nonlinear kernels without strong convexity, utilizing randomized low-dimensional kernel approximations and a reduced primal approach.
Findings
Achieves similar prediction accuracy as existing SVM solvers
Reduces training time significantly
Develops prediction schemes independent of support vector count
Abstract
Subgradient algorithms for training support vector machines have been quite successful for solving large-scale and online learning problems. However, they have been restricted to linear kernels and strongly convex formulations. This paper describes efficient subgradient approaches without such limitations. Our approaches make use of randomized low-dimensional approximations to nonlinear kernels, and minimization of a reduced primal formulation using an algorithm based on robust stochastic approximation, which do not require strong convexity. Experiments illustrate that our approaches produce solutions of comparable prediction accuracy with the solutions acquired from existing SVM solvers, but often in much shorter time. We also suggest efficient prediction schemes that depend only on the dimension of kernel approximation, not on the number of support vectors.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Stochastic Gradient Optimization Techniques · Advanced Bandit Algorithms Research
