Statistical sentiment analysis performance in Opinum

Boyan Bonev; Gema Ram\'irez-S\'anchez; Sergio Ortiz Rojas

arXiv:1303.0446·cs.CL·March 5, 2013

Statistical sentiment analysis performance in Opinum

Boyan Bonev, Gema Ram\'irez-S\'anchez, Sergio Ortiz Rojas

PDF

Open Access

TL;DR

This paper evaluates the Opinum statistical approach for sentiment analysis, which models word order without syntactic or semantic info, achieving over 81% accuracy on Spanish financial opinions.

Contribution

It introduces a simple probabilistic model based on word order, lemmatization, and entity replacement, demonstrating effective sentiment classification without complex linguistic features.

Findings

01

Achieves over 81% accuracy on Spanish financial opinions

02

Highlights importance of lemmatization and entity replacement

03

Discusses factors impacting classification performance

Abstract

The classification of opinion texts in positive and negative is becoming a subject of great interest in sentiment analysis. The existence of many labeled opinions motivates the use of statistical and machine-learning methods. First-order statistics have proven to be very limited in this field. The Opinum approach is based on the order of the words without using any syntactic and semantic information. It consists of building one probabilistic model for the positive and another one for the negative opinions. Then the test opinions are compared to both models and a decision and confidence measure are calculated. In order to reduce the complexity of the training corpus we first lemmatize the texts and we replace most named-entities with wildcards. Opinum presents an accuracy above 81% for Spanish opinions in the financial products domain. In this work we discuss which are the most important…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSentiment Analysis and Opinion Mining · Natural Language Processing Techniques · Topic Modeling