ActorMind: Emulating Human Actor Reasoning for Speech Role-Playing

Xi Chen; Wei Xue; Yike Guo

arXiv:2604.11103·cs.SD·April 27, 2026

ActorMind: Emulating Human Actor Reasoning for Speech Role-Playing

Xi Chen, Wei Xue, Yike Guo

PDF

TL;DR

This paper introduces ActorMind, a framework for speech role-playing that emulates human actor reasoning, supported by a hierarchical benchmark, ActorMindBench, to advance human-machine interaction and sociological research.

Contribution

It presents a novel multi-agent reasoning framework and a comprehensive benchmark for speech role-playing, addressing the gap in speech-based role-playing models.

Findings

01

ActorMind improves speech role-playing performance.

02

ActorMindBench provides extensive hierarchical data for evaluation.

03

The framework emulates human actor reasoning processes.

Abstract

Role-playing has garnered rising attention as it provides a strong foundation for human-machine interaction and facilitates sociological research. However, current work is confined to textual modalities, neglecting speech, which plays a predominant role in daily life, thus limiting genuine role-playing. To bridge this gap, we conceptualize and benchmark speech role-playing through ActorMindBench, and we present a corresponding reasoning framework, called ActorMind. Specifically, (1) Speech Role-Playing enables models to deliver spontaneous responses with personalized verbal traits based on their role, the scene, and spoken dialogue. (2) ActorMindBench is a hierarchical benchmark comprises Utterance-Level content with 7,653 utterances, Scene-Level content with 313 scenes, and Role-Level content with 6 roles. (3) ActorMind is an off-the-shelf, multi-agent, chain-of-though style reasoning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.