CogDual: Enhancing Dual Cognition of LLMs via Reinforcement Learning with Implicit Rule-Based Rewards

Cheng Liu; Yifei Lu; Fanghua Ye; Jian Li; Xingyu Chen; Feiliang Ren; Zhaopeng Tu; Xiaolong Li

arXiv:2507.17147·cs.CL·July 24, 2025

CogDual: Enhancing Dual Cognition of LLMs via Reinforcement Learning with Implicit Rule-Based Rewards

Cheng Liu, Yifei Lu, Fanghua Ye, Jian Li, Xingyu Chen, Feiliang Ren, Zhaopeng Tu, Xiaolong Li

PDF

Open Access 1 Video

TL;DR

CogDual introduces a cognitively inspired framework for role-playing language agents that jointly models awareness and uses reinforcement learning to improve consistency and contextuality in open-domain text generation.

Contribution

It proposes a novel cognize-then-respond paradigm for RPLAs, integrating external and internal awareness, and employs reinforcement learning with reward schemes for better performance.

Findings

01

Outperforms existing baselines on CoSER, Cross-MR, and LifeChoice benchmarks.

02

Demonstrates improved character consistency and contextual alignment.

03

Shows effective generalization across diverse role-playing tasks.

Abstract

Role-Playing Language Agents (RPLAs) have emerged as a significant application direction for Large Language Models (LLMs). Existing approaches typically rely on prompt engineering or supervised fine-tuning to enable models to imitate character behaviors in specific scenarios, but often neglect the underlying \emph{cognitive} mechanisms driving these behaviors. Inspired by cognitive psychology, we introduce \textbf{CogDual}, a novel RPLA adopting a \textit{cognize-then-respond } reasoning paradigm. By jointly modeling external situational awareness and internal self-awareness, CogDual generates responses with improved character consistency and contextual alignment. To further optimize the performance, we employ reinforcement learning with two general-purpose reward schemes designed for open-domain text generation. Extensive experiments on the CoSER benchmark, as well as Cross-MR and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

CogDual: Enhancing Dual Cognition of LLMs via Reinforcement Learning with Implicit Rule-Based Rewards· underline

Taxonomy

TopicsData Stream Mining Techniques · Reinforcement Learning in Robotics · Fuzzy Logic and Control Systems