Sentiment Analysis of Lithuanian Online Reviews Using Large Language Models
Brigita Vileikyt\.e, Mantas Luko\v{s}evi\v{c}ius, Lukas, Stankevi\v{c}ius

TL;DR
This paper explores the use of fine-tuned multilingual Large Language Models, specifically BERT and T5, for sentiment analysis of Lithuanian online reviews, achieving high accuracy and outperforming GPT-4.
Contribution
It is the first to apply transformer models to Lithuanian sentiment analysis, demonstrating their effectiveness over traditional methods and commercial models.
Findings
Fine-tuned models achieved 80.74% accuracy on one-star reviews.
Fine-tuned models achieved 89.61% accuracy on five-star reviews.
Models outperform GPT-4 in sentiment recognition accuracy.
Abstract
Sentiment analysis is a widely researched area within Natural Language Processing (NLP), attracting significant interest due to the advent of automated solutions. Despite this, the task remains challenging because of the inherent complexity of languages and the subjective nature of sentiments. It is even more challenging for less-studied and less-resourced languages such as Lithuanian. Our review of existing Lithuanian NLP research reveals that traditional machine learning methods and classification algorithms have limited effectiveness for the task. In this work, we address sentiment analysis of Lithuanian five-star-based online reviews from multiple domains that we collect and clean. We apply transformer models to this task for the first time, exploring the capabilities of pre-trained multilingual Large Language Models (LLMs), specifically focusing on fine-tuning BERT and T5 models.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Computational and Text Analysis Methods · Topic Modeling
MethodsAttention Is All You Need · Position-Wise Feed-Forward Layer · Byte Pair Encoding · Absolute Position Encodings · Inverse Square Root Schedule · Dropout · Label Smoothing · WordPiece · Transformer · Weight Decay
