Training Support Vector Machines Using Frank-Wolfe Optimization Methods
Emanuele Frandi, Ricardo Nanculef, Maria Grazia Gasparo, Stefano Lodi,, Claudio Sartori

TL;DR
This paper introduces two novel Frank-Wolfe based algorithms for training SVMs that improve scalability and efficiency over traditional CVM methods, especially on large datasets, with comparable accuracy.
Contribution
The paper proposes new Frank-Wolfe algorithms for SVM training that avoid complex quadratic programming, enhancing scalability and applicability to a broader range of kernels.
Findings
Algorithms scale better than CVMs on large datasets
Methods require only analytic optimization steps
Achieve comparable accuracy with wider kernel applicability
Abstract
Training a Support Vector Machine (SVM) requires the solution of a quadratic programming problem (QP) whose computational complexity becomes prohibitively expensive for large scale datasets. Traditional optimization methods cannot be directly applied in these cases, mainly due to memory restrictions. By adopting a slightly different objective function and under mild conditions on the kernel used within the model, efficient algorithms to train SVMs have been devised under the name of Core Vector Machines (CVMs). This framework exploits the equivalence of the resulting learning problem with the task of building a Minimal Enclosing Ball (MEB) problem in a feature space, where data is implicitly embedded by a kernel function. In this paper, we improve on the CVM approach by proposing two novel methods to build SVMs based on the Frank-Wolfe algorithm, recently revisited as a fast method…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
