Effect of Word Embedding Variable Parameters on Arabic Sentiment   Analysis Performance

Anwar Alnawas; Nursal ARICI

arXiv:2101.02906·cs.CL·January 11, 2021·1 cites

Effect of Word Embedding Variable Parameters on Arabic Sentiment Analysis Performance

Anwar Alnawas, Nursal ARICI

PDF

Open Access

TL;DR

This paper investigates how varying word embedding parameters like window size, vector dimension, and negative sampling affect Arabic sentiment analysis performance using different architectures and classifiers.

Contribution

It introduces a detailed analysis of parameter effects on Arabic sentiment analysis with word embeddings, filling a gap in existing research.

Findings

01

Optimal window size improves classifier accuracy.

02

Higher vector dimensions enhance sentiment detection.

03

Negative sampling parameter impacts embedding quality.

Abstract

Social media such as Twitter, Facebook, etc. has led to a generated growing number of comments that contains users opinions. Sentiment analysis research deals with these comments to extract opinions which are positive or negative. Arabic language is a rich morphological language; thus, classical techniques of English sentiment analysis cannot be used for Arabic. Word embedding technique can be considered as one of successful methods to gaping the morphological problem of Arabic. Many works have been done for Arabic sentiment analysis based on word embedding, but there is no study focused on variable parameters. This study will discuss three parameters (Window size, Dimension of vector and Negative Sample) for Arabic sentiment analysis using DBOW and DMPV architectures. A large corpus of previous works generated to learn word representations and extract features. Four binary classifiers…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSentiment Analysis and Opinion Mining · Topic Modeling · Advanced Text Analysis Techniques