A Precisely Xtreme-Multi Channel Hybrid Approach For Roman Urdu Sentiment Analysis
Faiza Memood, Muhammad Usman Ghani, Muhammad Ali Ibrahim, Rehab, Shehzadi, Muhammad Nabeel Asim

TL;DR
This paper introduces a new Roman Urdu sentiment analysis framework utilizing three neural word embeddings, a novel dataset, and a hybrid multi-channel approach that significantly outperforms existing methods in F1-score.
Contribution
It presents the first Roman Urdu dataset, develops neural embeddings with intrinsic and extrinsic evaluation, and proposes a novel hybrid approach that improves sentiment analysis accuracy.
Findings
Neural embeddings evaluated successfully with intrinsic and extrinsic methods.
The hybrid approach outperforms state-of-the-art methods by 9% in F1-score.
Benchmark dataset enables future research in Roman Urdu sentiment analysis.
Abstract
In order to accelerate the performance of various Natural Language Processing tasks for Roman Urdu, this paper for the very first time provides 3 neural word embeddings prepared using most widely used approaches namely Word2vec, FastText, and Glove. The integrity of generated neural word embeddings is evaluated using intrinsic and extrinsic evaluation approaches. Considering the lack of publicly available benchmark datasets, it provides a first-ever Roman Urdu dataset which consists of 3241 sentiments annotated against positive, negative and neutral classes. To provide benchmark baseline performance over the presented dataset, we adapt diverse machine learning (Support Vector Machine Logistic Regression, Naive Bayes), deep learning (convolutional neural network, recurrent neural network), and hybrid approaches. Effectiveness of generated neural word embeddings is evaluated by comparing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Sentiment Analysis and Opinion Mining
MethodsLogistic Regression
