From Prediction to Justification: Aligning Sentiment Reasoning with Human Rationale via Reinforcement Learning

Shihao Zhang; Ziwei Wang; Jie Zhou; Yulan Wu; Qin Chen; Zhikai Lei; Liyang Yu; Liang Dou; Liang He

arXiv:2604.13398·cs.CL·April 16, 2026

From Prediction to Justification: Aligning Sentiment Reasoning with Human Rationale via Reinforcement Learning

Shihao Zhang, Ziwei Wang, Jie Zhou, Yulan Wu, Qin Chen, Zhikai Lei, Liyang Yu, Liang Dou, Liang He

PDF

TL;DR

This paper presents ABSA-R1, a reinforcement learning framework that enables sentiment analysis models to generate human-like justifications, improving interpretability and accuracy by aligning reasoning with predictions.

Contribution

It introduces a novel RL-based approach that produces explicit natural language explanations for sentiment predictions, bridging the gap between black-box models and human reasoning.

Findings

01

Enhanced interpretability through natural language justifications.

02

Improved sentiment classification and triplet extraction performance.

03

Effective handling of hard cases via a performance-driven rejection sampling.

Abstract

While Aspect-based Sentiment Analysis (ABSA) systems have achieved high accuracy in identifying sentiment polarities, they often operate as "black boxes," lacking the explicit reasoning capabilities characteristic of human affective cognition. Humans do not merely categorize sentiment; they construct causal explanations for their judgments. To bridge this gap, we propose ABSA-R1, a large language model framework designed to mimic this ``reason-before-predict" cognitive process. By leveraging reinforcement learning (RL), ABSA-R1 learns to articulate the why behind the what, generating natural language justifications that ground its sentiment predictions. We introduce a Cognition-Aligned Reward Model (formerly sentiment-aware reward model) that enforces consistency between the generated reasoning path and the final emotional label. Furthermore, inspired by metacognitive monitoring, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.