Training Support Vector Machines Using Frank-Wolfe Optimization Methods

Emanuele Frandi; Ricardo Nanculef; Maria Grazia Gasparo; Stefano Lodi,; Claudio Sartori

arXiv:1212.0695·cs.LG·January 29, 2014

Training Support Vector Machines Using Frank-Wolfe Optimization Methods

Emanuele Frandi, Ricardo Nanculef, Maria Grazia Gasparo, Stefano Lodi,, Claudio Sartori

PDF

TL;DR

This paper introduces two novel Frank-Wolfe based algorithms for training SVMs that improve scalability and efficiency over traditional CVM methods, especially on large datasets, with comparable accuracy.

Contribution

The paper proposes new Frank-Wolfe algorithms for SVM training that avoid complex quadratic programming, enhancing scalability and applicability to a broader range of kernels.

Findings

01

Algorithms scale better than CVMs on large datasets

02

Methods require only analytic optimization steps

03

Achieve comparable accuracy with wider kernel applicability

Abstract

Training a Support Vector Machine (SVM) requires the solution of a quadratic programming problem (QP) whose computational complexity becomes prohibitively expensive for large scale datasets. Traditional optimization methods cannot be directly applied in these cases, mainly due to memory restrictions. By adopting a slightly different objective function and under mild conditions on the kernel used within the model, efficient algorithms to train SVMs have been devised under the name of Core Vector Machines (CVMs). This framework exploits the equivalence of the resulting learning problem with the task of building a Minimal Enclosing Ball (MEB) problem in a feature space, where data is implicitly embedded by a kernel function. In this paper, we improve on the CVM approach by proposing two novel methods to build SVMs based on the Frank-Wolfe algorithm, recently revisited as a fast method…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.