Polarization Measurement of High Dimensional Social Media Messages With Support Vector Machine Algorithm Using Mapreduce
Ferhat \"Ozg\"ur \c{C}atak

TL;DR
This paper introduces a distributed MapReduce-based SVM training algorithm to efficiently classify large-scale social media messages for polarization analysis, overcoming traditional SVM computational limitations.
Contribution
The paper presents a novel distributed MapReduce approach for training SVMs on large datasets, enabling scalable polarization measurement of social media messages.
Findings
Effective SVM training on large social media datasets
High classification accuracy demonstrated with Twitter data
Scalable method reduces training time for big data
Abstract
In this article, we propose a new Support Vector Machine (SVM) training algorithm based on distributed MapReduce technique. In literature, there are a lots of research that shows us SVM has highest generalization property among classification algorithms used in machine learning area. Also, SVM classifier model is not affected by correlations of the features. But SVM uses quadratic optimization techniques in its training phase. The SVM algorithm is formulated as quadratic optimization problem. Quadratic optimization problem has time and space complexity, where m is the training set size. The computation time of SVM training is quadratic in the number of training instances. In this reason, SVM is not a suitable classification algorithm for large scale dataset classification. To solve this training problem we developed a new distributed MapReduce method developed.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies
MethodsSupport Vector Machine
