Benchmarking Logistic Regression, SVM, Naive Bayes, and IndoBERT Fine-Tuning for Sentiment Analysis on Indonesian Product Reviews
Nabila Zakiyah Zahra, Salwa Farhanatussaidah, Nasywa Nur Afifah, Luluk Muthoharoh, Ardika Satria, and Martin C.T. Manullang

TL;DR
This paper compares traditional ML models and IndoBERT for Indonesian sentiment analysis, finding that SVM outperforms IndoBERT on a large e-commerce review dataset, with deployment as a web app.
Contribution
It benchmarks classical ML methods against IndoBERT on Indonesian reviews, highlighting the effectiveness of SVM and addressing class imbalance with custom loss functions.
Findings
Linear SVC achieved 97.60% accuracy, outperforming IndoBERT.
Traditional models outperformed transformer-based models in this setup.
The pipeline was successfully deployed as an interactive web application.
Abstract
The exponential growth of e-commerce platforms in Indonesia has generated a massive volume of user-generated product reviews. Analyzing the sentiment of these reviews is critical for measuring customer satisfaction and identifying product issues at scale. This paper benchmarks traditional Machine Learning (ML) approaches against a Transformer-based Deep Learning model for a three-class sentiment analysis task (positive, neutral, negative) on the Tokopedia Product Reviews 2025 dataset. We implemented Term Frequency-Inverse Document Frequency (TF-IDF) feature extraction coupled with three algorithms: Logistic Regression, Linear Support Vector Machine (SVM), and Multinomial Naive Bayes as robust baselines. Subsequently, we fine-tuned the IndoBERT model (indobenchmark/indobert-base-p1) for contextual sequence classification. To computationally address the severe class imbalance inherent in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
