IUST at SemEval-2020 Task 9: Sentiment Analysis for Code-Mixed Social Media Text using Deep Neural Networks and Linear Baselines
Soroush Javdan, Taha Shangipour ataei, Behrouz Minaei-Bidgoli

TL;DR
This paper presents a system for sentiment analysis of code-mixed social media texts using deep neural networks and linear baselines, achieving competitive F1 scores in a shared task.
Contribution
The paper introduces a combination of preprocessing techniques and models, including deep neural networks and linear baselines, for sentiment analysis of code-mixed social media data.
Findings
F1 score of 0.751 for Spanish-English sub-task
F1 score of 0.706 for Hindi-English sub-task
Effective use of deep neural networks and linear models
Abstract
Sentiment Analysis is a well-studied field of Natural Language Processing. However, the rapid growth of social media and noisy content within them poses significant challenges in addressing this problem with well-established methods and tools. One of these challenges is code-mixing, which means using different languages to convey thoughts in social media texts. Our group, with the name of IUST(username: TAHA), participated at the SemEval-2020 shared task 9 on Sentiment Analysis for Code-Mixed Social Media Text, and we have attempted to develop a system to predict the sentiment of a given code-mixed tweet. We used different preprocessing techniques and proposed to use different methods that vary from NBSVM to more complicated deep neural network models. Our best performing method obtains an F1 score of 0.751 for the Spanish-English sub-task and 0.706 over the Hindi-English sub-task.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
