Mini-Batch Primal and Dual Methods for SVMs
Martin Tak\'a\v{c}, Avleen Bijral, Peter Richt\'arik, Nathan, Srebro

TL;DR
This paper investigates how mini-batch techniques affect the efficiency of stochastic optimization methods for SVMs, revealing that data spectral norm governs parallelization speedup and proposing new mini-batched SDCA variants.
Contribution
It introduces novel mini-batched SDCA variants and links parallelization speedup to the spectral norm of data for both primal and dual SVM optimization methods.
Findings
Spectral norm controls parallelization speedup for primal and dual methods.
New mini-batched SDCA variants with theoretical guarantees.
Optimization guarantees are based on the original hinge-loss primal problem.
Abstract
We address the issue of using mini-batches in stochastic optimization of SVMs. We show that the same quantity, the spectral norm of the data, controls the parallelization speedup obtained for both primal stochastic subgradient descent (SGD) and stochastic dual coordinate ascent (SCDA) methods and use it to derive novel variants of mini-batched SDCA. Our guarantees for both methods are expressed in terms of the original nonsmooth primal problem based on the hinge-loss.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Neural Networks and Applications
