Enhancing Sentiment Analysis in Bengali Texts: A Hybrid Approach Using Lexicon-Based Algorithm and Pretrained Language Model Bangla-BERT
Hemal Mahmud, Hasan Mahmud, Mohammad Rifat Ahmmad Rashid

TL;DR
This paper presents a hybrid sentiment analysis method for Bengali texts that combines a lexicon-based algorithm with a pre-trained language model, outperforming individual models in accuracy and nuanced classification.
Contribution
The study introduces a novel hybrid approach integrating rule-based algorithms with Bangla-BERT for improved Bengali sentiment analysis, including a new dataset and a sentiment scoring algorithm.
Findings
Hybrid approach outperforms standalone BanglaBERT in accuracy.
Constructed a new dataset with 15,000 labeled reviews.
Achieved better classification across nine sentiment categories.
Abstract
Sentiment analysis (SA) is a process of identifying the emotional tone or polarity within a given text and aims to uncover the user's complex emotions and inner feelings. While sentiment analysis has been extensively studied for languages like English, research in Bengali, remains limited, particularly for fine-grained sentiment categorization. This work aims to connect this gap by developing a novel approach that integrates rule-based algorithms with pre-trained language models. We developed a dataset from scratch, comprising over 15,000 manually labeled reviews. Next, we constructed a Lexicon Data Dictionary, assigning polarity scores to the reviews. We developed a novel rule based algorithm Bangla Sentiment Polarity Score (BSPS), an approach capable of generating sentiment scores and classifying reviews into nine distinct sentiment categories. To assess the performance of this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Advanced Text Analysis Techniques · Topic Modeling
