From Prediction to Justification: Aligning Sentiment Reasoning with Human Rationale via Reinforcement Learning
Shihao Zhang, Ziwei Wang, Jie Zhou, Yulan Wu, Qin Chen, Zhikai Lei, Liyang Yu, Liang Dou, Liang He

TL;DR
This paper presents ABSA-R1, a reinforcement learning framework that enables sentiment analysis models to generate human-like justifications, improving interpretability and accuracy by aligning reasoning with predictions.
Contribution
It introduces a novel RL-based approach that produces explicit natural language explanations for sentiment predictions, bridging the gap between black-box models and human reasoning.
Findings
Enhanced interpretability through natural language justifications.
Improved sentiment classification and triplet extraction performance.
Effective handling of hard cases via a performance-driven rejection sampling.
Abstract
While Aspect-based Sentiment Analysis (ABSA) systems have achieved high accuracy in identifying sentiment polarities, they often operate as "black boxes," lacking the explicit reasoning capabilities characteristic of human affective cognition. Humans do not merely categorize sentiment; they construct causal explanations for their judgments. To bridge this gap, we propose ABSA-R1, a large language model framework designed to mimic this ``reason-before-predict" cognitive process. By leveraging reinforcement learning (RL), ABSA-R1 learns to articulate the why behind the what, generating natural language justifications that ground its sentiment predictions. We introduce a Cognition-Aligned Reward Model (formerly sentiment-aware reward model) that enforces consistency between the generated reasoning path and the final emotional label. Furthermore, inspired by metacognitive monitoring, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
