A Divide-and-Conquer Solver for Kernel Support Vector Machines
Cho-Jui Hsieh, Si Si, Inderjit S. Dhillon

TL;DR
This paper introduces a divide-and-conquer approach for kernel SVMs that significantly speeds up training on large datasets while maintaining high accuracy, by partitioning data and efficiently combining solutions.
Contribution
The paper proposes a novel multilevel divide-and-conquer algorithm for kernel SVMs, with theoretical support and practical improvements over existing methods.
Findings
DC-SVM is 7 times faster than LIBSVM on large datasets.
The method achieves 96.15% accuracy with much less training time.
Early prediction strategy reduces training time by over 100 times.
Abstract
The kernel support vector machine (SVM) is one of the most widely used classification methods; however, the amount of computation required becomes the bottleneck when facing millions of samples. In this paper, we propose and analyze a novel divide-and-conquer solver for kernel SVMs (DC-SVM). In the division step, we partition the kernel SVM problem into smaller subproblems by clustering the data, so that each subproblem can be solved independently and efficiently. We show theoretically that the support vectors identified by the subproblem solution are likely to be support vectors of the entire kernel SVM problem, provided that the problem is partitioned appropriately by kernel clustering. In the conquer step, the local solutions from the subproblems are used to initialize a global coordinate descent solver, which converges quickly as suggested by our analysis. By extending this idea, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and ELM · Face and Expression Recognition · Text and Document Classification Technologies
MethodsSupport Vector Machine
