Polarization Measurement of High Dimensional Social Media Messages With   Support Vector Machine Algorithm Using Mapreduce

Ferhat \"Ozg\"ur \c{C}atak

arXiv:1410.2686·cs.LG·March 12, 2015·1 cites

Polarization Measurement of High Dimensional Social Media Messages With Support Vector Machine Algorithm Using Mapreduce

Ferhat \"Ozg\"ur \c{C}atak

PDF

Open Access

TL;DR

This paper introduces a distributed MapReduce-based SVM training algorithm to efficiently classify large-scale social media messages for polarization analysis, overcoming traditional SVM computational limitations.

Contribution

The paper presents a novel distributed MapReduce approach for training SVMs on large datasets, enabling scalable polarization measurement of social media messages.

Findings

01

Effective SVM training on large social media datasets

02

High classification accuracy demonstrated with Twitter data

03

Scalable method reduces training time for big data

Abstract

In this article, we propose a new Support Vector Machine (SVM) training algorithm based on distributed MapReduce technique. In literature, there are a lots of research that shows us SVM has highest generalization property among classification algorithms used in machine learning area. Also, SVM classifier model is not affected by correlations of the features. But SVM uses quadratic optimization techniques in its training phase. The SVM algorithm is formulated as quadratic optimization problem. Quadratic optimization problem has $O (m^{3})$ time and $O (m^{2})$ space complexity, where m is the training set size. The computation time of SVM training is quadratic in the number of training instances. In this reason, SVM is not a suitable classification algorithm for large scale dataset classification. To solve this training problem we developed a new distributed MapReduce method developed.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText and Document Classification Technologies

MethodsSupport Vector Machine