SentiBench - a benchmark comparison of state-of-the-practice sentiment analysis methods
Filipe Nunes Ribeiro, Matheus Ara\'ujo, Pollyanna Gon\c{c}alves,, Fabr\'icio Benevenuto, Marcos Andr\'e Gon\c{c}alves

TL;DR
This paper provides a comprehensive benchmark comparison of 24 popular sentiment analysis methods across 18 diverse datasets, revealing significant performance variability and offering an open platform for evaluation and development.
Contribution
It introduces the first extensive apple-to-apple benchmark of sentiment analysis methods in practical settings, with open code and datasets for ongoing research.
Findings
Performance varies greatly across datasets
Supervised methods generally outperform lexical-based approaches
Open API facilitates future method development
Abstract
In the last few years thousands of scientific papers have investigated sentiment analysis, several startups that measure opinions on real data have emerged and a number of innovative products related to this theme have been developed. There are multiple methods for measuring sentiments, including lexical-based and supervised machine learning methods. Despite the vast interest on the theme and wide popularity of some methods, it is unclear which one is better for identifying the polarity (i.e., positive or negative) of a message. Accordingly, there is a strong need to conduct a thorough apple-to-apple comparison of sentiment analysis methods, \textit{as they are used in practice}, across multiple datasets originated from different data sources. Such a comparison is key for understanding the potential limitations, advantages, and disadvantages of popular methods. This article aims at…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Topic Modeling · Advanced Text Analysis Techniques
