PERM: Psychology-grounded Empathetic Reward Modeling for Large Language Models

Chengbing Wang; Wuqiang Zheng; Yang Zhang; Fengbin Zhu; Junyi Cheng; Yi Xie; Wenjie Wang; Fuli Feng

arXiv:2601.10532·cs.CL·January 19, 2026

PERM: Psychology-grounded Empathetic Reward Modeling for Large Language Models

Chengbing Wang, Wuqiang Zheng, Yang Zhang, Fengbin Zhu, Junyi Cheng, Yi Xie, Wenjie Wang, Fuli Feng

PDF

Open Access

TL;DR

PERM introduces a psychology-grounded, bidirectional reward model for LLMs that improves empathetic responses by evaluating both supporter and seeker perspectives, outperforming existing methods in benchmarks and user preferences.

Contribution

This paper presents PERM, a novel bidirectional empathy evaluation framework grounded in psychology, enhancing LLMs' empathetic response generation beyond unidirectional models.

Findings

01

PERM outperforms state-of-the-art baselines by over 10% on benchmarks.

02

A blinded user study shows 70% preference for PERM-based responses.

03

PERM effectively incorporates supporter, seeker, and bystander perspectives.

Abstract

Large Language Models (LLMs) are increasingly deployed in human-centric applications, yet they often fail to provide substantive emotional support. While Reinforcement Learning (RL) has been utilized to enhance empathy of LLMs, existing reward models typically evaluate empathy from a single perspective, overlooking the inherently bidirectional interaction nature of empathy between the supporter and seeker as defined by Empathy Cycle theory. To address this limitation, we propose Psychology-grounded Empathetic Reward Modeling (PERM). PERM operationalizes empathy evaluation through a bidirectional decomposition: 1) Supporter perspective, assessing internal resonation and communicative expression; 2) Seeker perspective, evaluating emotional reception. Additionally, it incorporates a bystander perspective to monitor overall interaction quality. Extensive experiments on a widely-used…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEmotion and Mood Recognition · Mental Health via Writing · Topic Modeling