Training Support Vector Machines using Coresets

Cenk Baykal; Lucas Liebenwein; Wilko Schwarting

arXiv:1708.03835·cs.DS·November 13, 2017·2 cites

Training Support Vector Machines using Coresets

Cenk Baykal, Lucas Liebenwein, Wilko Schwarting

PDF

Open Access

TL;DR

This paper introduces a new coreset construction algorithm that efficiently approximates data for training Support Vector Machines, significantly speeding up the process while maintaining accuracy.

Contribution

The paper presents a novel importance sampling-based coreset construction method with theoretical guarantees for scalable SVM training.

Findings

01

Outperforms existing coreset methods in speed and accuracy

02

Achieves low approximation error with smaller coresets

03

Enables faster SVM training on large datasets

Abstract

We present a novel coreset construction algorithm for solving classification tasks using Support Vector Machines (SVMs) in a computationally efficient manner. A coreset is a weighted subset of the original data points that provably approximates the original set. We show that coresets of size polylogarithmic in $n$ and polynomial in $d$ exist for a set of $n$ input points with $d$ features and present an $(ϵ, δ)$ -FPRAS for constructing coresets for scalable SVM training. Our method leverages the insight that data points are often redundant and uses an importance sampling scheme based on the sensitivity of each data point to construct coresets efficiently. We evaluate the performance of our algorithm in accelerating SVM training against real-world data sets and compare our algorithm to state-of-the-art coreset approaches. Our empirical results show that our approach outperforms…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Machine Learning and Data Classification · Face and Expression Recognition

MethodsCoresets · Support Vector Machine