Effect of Word Embedding Variable Parameters on Arabic Sentiment Analysis Performance
Anwar Alnawas, Nursal ARICI

TL;DR
This paper investigates how varying word embedding parameters like window size, vector dimension, and negative sampling affect Arabic sentiment analysis performance using different architectures and classifiers.
Contribution
It introduces a detailed analysis of parameter effects on Arabic sentiment analysis with word embeddings, filling a gap in existing research.
Findings
Optimal window size improves classifier accuracy.
Higher vector dimensions enhance sentiment detection.
Negative sampling parameter impacts embedding quality.
Abstract
Social media such as Twitter, Facebook, etc. has led to a generated growing number of comments that contains users opinions. Sentiment analysis research deals with these comments to extract opinions which are positive or negative. Arabic language is a rich morphological language; thus, classical techniques of English sentiment analysis cannot be used for Arabic. Word embedding technique can be considered as one of successful methods to gaping the morphological problem of Arabic. Many works have been done for Arabic sentiment analysis based on word embedding, but there is no study focused on variable parameters. This study will discuss three parameters (Window size, Dimension of vector and Negative Sample) for Arabic sentiment analysis using DBOW and DMPV architectures. A large corpus of previous works generated to learn word representations and extract features. Four binary classifiers…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Topic Modeling · Advanced Text Analysis Techniques
