Bidirectional Encoder Representations from Transformers (BERT): A sentiment analysis odyssey
Shivaji Alaparthi (Data Scientist, CenturyLink, Bengaluru, India) and, Manit Mishra (Associate Professor, International Management Institute, Bhubaneswar, India)

TL;DR
This study compares four sentiment analysis techniques, demonstrating that BERT significantly outperforms traditional models like LSTM, logistic regression, and lexicon-based methods on movie review data.
Contribution
It is the first to empirically compare BERT with other sentiment analysis models, highlighting its superior effectiveness in text classification tasks.
Findings
BERT achieves the highest accuracy, precision, recall, and F1 scores.
Pre-trained deep learning models outperform traditional and lexicon-based methods.
BERT's superiority is consistent across evaluation metrics.
Abstract
The purpose of the study is to investigate the relative effectiveness of four different sentiment analysis techniques: (1) unsupervised lexicon-based model using Sent WordNet; (2) traditional supervised machine learning model using logistic regression; (3) supervised deep learning model using Long Short-Term Memory (LSTM); and, (4) advanced supervised deep learning models using Bidirectional Encoder Representations from Transformers (BERT). We use publicly available labeled corpora of 50,000 movie reviews originally posted on internet movie database (IMDB) for analysis using Sent WordNet lexicon, logistic regression, LSTM, and BERT. The first three models were run on CPU based system whereas BERT was run on GPU based system. The sentiment classification performance was evaluated based on accuracy, precision, recall, and F1 score. The study puts forth two key insights: (1) relative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Stock Market Forecasting Methods · Topic Modeling
MethodsLinear Layer · Tanh Activation · Sigmoid Activation · Multi-Head Attention · Residual Connection · Attention Is All You Need · Attention Dropout · Weight Decay · Adam · Long Short-Term Memory
