Rhea: Role-aware Heuristic Episodic Attention for Conversational LLMs

Wanyang Hong; Zhaoning Zhang; Yi Chen; Libo Zhang; Baihui Liu; Linbo Qiao; Zhiliang Tian; Dongsheng Li

arXiv:2512.06869·cs.CL·December 9, 2025

Rhea: Role-aware Heuristic Episodic Attention for Conversational LLMs

Wanyang Hong, Zhaoning Zhang, Yi Chen, Libo Zhang, Baihui Liu, Linbo Qiao, Zhiliang Tian, Dongsheng Li

PDF

Open Access

TL;DR

Rhea introduces a role-aware attention framework that effectively mitigates contextual decay in multi-turn conversational LLMs by separating global constraints and interaction memories, leading to improved accuracy and instruction fidelity.

Contribution

The paper proposes Rhea, a novel role-aware heuristic episodic attention framework that decouples conversation history into independent memory modules for better context management in LLMs.

Findings

01

Rhea improves multi-turn conversation accuracy by 16% over baselines.

02

Rhea maintains high instruction fidelity (IAR > 8.1) in long interactions.

03

Experiments on multiple benchmarks validate Rhea's effectiveness.

Abstract

Large Language Models (LLMs) have achieved remarkable performance on single-turn tasks, yet their effectiveness deteriorates in multi-turn conversations. We define this phenomenon as cumulative contextual decay - a progressive degradation of contextual integrity caused by attention pollution, dilution, and drift. To address this challenge, we propose Rhea (Role-aware Heuristic Episodic Attention), a novel framework that decouples conversation history into two functionally independent memory modules: (1) an Instructional Memory (IM) that persistently stores high-fidelity global constraints via a structural priority mechanism, and (2) an Episodic Memory (EM) that dynamically manages user-model interactions via asymmetric noise control and heuristic context retrieval. During inference, Rhea constructs a high signal-to-noise context by applying its priority attention: selectively…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Machine Learning in Healthcare