Scaling SVM and Least Absolute Deviations via Exact Data Reduction
Jie Wang, Peter Wonka, Jieping Ye

TL;DR
This paper introduces a safe, efficient data reduction technique called DVI for SVM and LAD that significantly decreases computational costs by discarding non-support vectors, enabling faster large-scale classification and regression.
Contribution
The paper presents the first screening method for LAD and extends DVI to efficiently identify non-support vectors in SVM and LAD problems.
Findings
DVI can discard up to 99% of data points as non-support vectors.
The method achieves up to 100x speedup in SVM and LAD computations.
DVI outperforms existing screening rules in accuracy and efficiency.
Abstract
The support vector machine (SVM) is a widely used method for classification. Although many efforts have been devoted to develop efficient solvers, it remains challenging to apply SVM to large-scale problems. A nice property of SVM is that the non-support vectors have no effect on the resulting classifier. Motivated by this observation, we present fast and efficient screening rules to discard non-support vectors by analyzing the dual problem of SVM via variational inequalities (DVI). As a result, the number of data instances to be entered into the optimization can be substantially reduced. Some appealing features of our screening method are: (1) DVI is safe in the sense that the vectors discarded by DVI are guaranteed to be non-support vectors; (2) the data set needs to be scanned only once to run the screening, whose computational cost is negligible compared to that of solving the SVM…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Sparse and Compressive Sensing Techniques · Multi-Criteria Decision Making
MethodsSupport Vector Machine
