How Emotion Shapes the Behavior of LLMs and Agents: A Mechanistic Study
Moran Sun, Tianlin Li, Yuwei Zheng, Zhenhong Zhou, Aishan Liu, Xianglong Liu, Yang Liu

TL;DR
This paper introduces E-STEER, a framework for embedding and controlling emotion in LLMs, revealing how emotion influences reasoning, safety, and multi-step behaviors.
Contribution
It presents a novel, interpretable method for directly manipulating emotion in LLMs and agents at the representation level.
Findings
Emotion affects LLM reasoning and generation in complex ways.
Specific emotions can improve safety and capabilities of LLMs.
Emotion systematically influences multi-step agent behaviors.
Abstract
Emotion plays an important role in human cognition and performance. Motivated by this, we investigate whether analogous emotional signals can shape the behavior of large language models (LLMs) and agents. Existing emotion-aware studies mainly treat emotion as a surface-level style factor or a perception target, overlooking its mechanistic role in task processing. To address this limitation, we propose E-STEER, an interpretable emotion steering framework that enables direct representation-level intervention in LLMs and agents. It embeds emotion as a structured, controllable variable in hidden states, and with it, we examine the impact of emotion on objective reasoning, subjective generation, safety, and multi-step agent behaviors. The results reveal non-monotonic emotion-behavior relations consistent with established psychological theories, and show that specific emotions not only…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
