A Novel Approach to Distributed Multi-Class SVM
Aruna Govada, Shree Ranjani, Aditi Viswanathan, S.K.Sahay

TL;DR
This paper introduces a distributed multi-class SVM algorithm that leverages a divide-and-conquer strategy in Hadoop, significantly reducing prediction time and improving accuracy on large datasets compared to traditional methods.
Contribution
It presents a novel distributed multi-class SVM algorithm that efficiently handles large datasets using a recursive divide-and-conquer approach in Hadoop.
Findings
Reduces prediction time compared to traditional SVMs
Achieves higher classification accuracy on large datasets
Demonstrates scalability with increasing data size
Abstract
With data sizes constantly expanding, and with classical machine learning algorithms that analyze such data requiring larger and larger amounts of computation time and storage space, the need to distribute computation and memory requirements among several computers has become apparent. Although substantial work has been done in developing distributed binary SVM algorithms and multi-class SVM algorithms individually, the field of multi-class distributed SVMs remains largely unexplored. This research proposes a novel algorithm that implements the Support Vector Machine over a multi-class dataset and is efficient in a distributed environment (here, Hadoop). The idea is to divide the dataset into half recursively and thus compute the optimal Support Vector Machine for this half during the training phase, much like a divide and conquer approach. While testing, this structure has been…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSupport Vector Machine
