The Stochastic Conjugate Subgradient Algorithm For Kernel Support Vector Machines
Di Zhang, Suvrajeet Sen

TL;DR
This paper introduces a stochastic conjugate subgradient algorithm tailored for kernel SVMs, offering faster convergence, better scalability, and improved accuracy over traditional stochastic first-order methods, especially in large-scale, non-smooth optimization problems.
Contribution
The paper presents a novel stochastic conjugate subgradient method with adaptive sampling and decomposition strategies for kernel SVMs, extending first-order methods to non-smooth, nonlinear problems.
Findings
Faster convergence per iteration compared to traditional SFO methods
Enhanced scalability in large-scale kernel SVM training
Improved speed and accuracy demonstrated in experiments
Abstract
Stochastic First-Order (SFO) methods have been a cornerstone in addressing a broad spectrum of modern machine learning (ML) challenges. However, their efficacy is increasingly questioned, especially in large-scale applications where empirical evidence indicates potential performance limitations. In response, this paper proposes an innovative method specifically designed for kernel support vector machines (SVMs). This method not only achieves faster convergence per iteration but also exhibits enhanced scalability when compared to conventional SFO techniques. Diverging from traditional sample average approximation strategies that typically frame kernel SVM as an 'all-in-one' Quadratic Program (QP), our approach adopts adaptive sampling. This strategy incrementally refines approximation accuracy on an 'as-needed' basis. Crucially, this approach also inspires a decomposition-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Face and Expression Recognition · Machine Learning and ELM
MethodsSupport Vector Machine · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
