The Stochastic Conjugate Subgradient Algorithm For Kernel Support Vector   Machines

Di Zhang; Suvrajeet Sen

arXiv:2407.21091·cs.LG·August 1, 2024

The Stochastic Conjugate Subgradient Algorithm For Kernel Support Vector Machines

Di Zhang, Suvrajeet Sen

PDF

Open Access

TL;DR

This paper introduces a stochastic conjugate subgradient algorithm tailored for kernel SVMs, offering faster convergence, better scalability, and improved accuracy over traditional stochastic first-order methods, especially in large-scale, non-smooth optimization problems.

Contribution

The paper presents a novel stochastic conjugate subgradient method with adaptive sampling and decomposition strategies for kernel SVMs, extending first-order methods to non-smooth, nonlinear problems.

Findings

01

Faster convergence per iteration compared to traditional SFO methods

02

Enhanced scalability in large-scale kernel SVM training

03

Improved speed and accuracy demonstrated in experiments

Abstract

Stochastic First-Order (SFO) methods have been a cornerstone in addressing a broad spectrum of modern machine learning (ML) challenges. However, their efficacy is increasingly questioned, especially in large-scale applications where empirical evidence indicates potential performance limitations. In response, this paper proposes an innovative method specifically designed for kernel support vector machines (SVMs). This method not only achieves faster convergence per iteration but also exhibits enhanced scalability when compared to conventional SFO techniques. Diverging from traditional sample average approximation strategies that typically frame kernel SVM as an 'all-in-one' Quadratic Program (QP), our approach adopts adaptive sampling. This strategy incrementally refines approximation accuracy on an 'as-needed' basis. Crucially, this approach also inspires a decomposition-based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Face and Expression Recognition · Machine Learning and ELM

MethodsSupport Vector Machine · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings