A Multi-Model Approach to English-Bangla Sentiment Classification of Government Mobile Banking App Reviews
Md. Naim Molla, Md Muhtasim Munif Fahim, Md. Binyamin, Md Jahid Hasan Imran, Tonmoy Shil, Nura Rayhan, and Md Rezaul Karim

TL;DR
This study compares traditional and transformer-based models for sentiment analysis of Bangladeshi government banking app reviews, revealing classical models outperform transformers and highlighting language resource gaps.
Contribution
It introduces a hybrid labeling approach and provides a comprehensive analysis of sentiment classification performance across models and languages.
Findings
Classical models like Random Forest achieved higher accuracy than transformers.
Transformers showed no significant advantage over traditional models in this context.
Significant performance gap exists between Bangla and English sentiment analysis accuracy.
Abstract
For millions of users in developing economies who depend on mobile banking as their primary gateway to financial services, app quality directly shapes financial access. The study analyzed 5,652 Google Play reviews in English and Bangla (filtered from 11,414 raw reviews) for four Bangladeshi government banking apps. The authors used a hybrid labeling approach that combined use of the reviewer's star rating for each review along with a separate independent XLM-RoBERTa classifier to produce moderate inter-method agreement (kappa = 0.459). Traditional models outperformed transformer-based ones: Random Forest produced the highest accuracy (0.815), while Linear SVM produced the highest weighted F1 score (0.804); both were higher than the performance of fine-tuned XLM-RoBERTa (0.793). McNemar's test confirmed that all classical models were significantly superior to the off-the-shelf…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
