Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction
Haoqiu Yan, Yongxin Zhu, Kai Zheng, Bing Liu, Haoyu Cao, Deqiang Jiang, and Linli Xu

TL;DR
This paper introduces PerceptiveAgent, a multi-modal dialogue system that uses acoustic perception to understand speaker intentions better and generate more empathetic, contextually nuanced responses in human-AI conversations.
Contribution
It presents a novel system integrating speech modality perception with LLMs to improve empathetic understanding in dialogue systems, addressing limitations of text-only approaches.
Findings
PerceptiveAgent accurately interprets speaker intentions in complex scenarios.
The system produces more nuanced and expressive empathetic responses.
Experimental results show improved contextual understanding over baseline models.
Abstract
Large Language Model (LLM)-enhanced agents become increasingly prevalent in Human-AI communication, offering vast potential from entertainment to professional domains. However, current multi-modal dialogue systems overlook the acoustic information present in speech, which is crucial for understanding human communication nuances. This oversight can lead to misinterpretations of speakers' intentions, resulting in inconsistent or even contradictory responses within dialogues. To bridge this gap, in this paper, we propose PerceptiveAgent, an empathetic multi-modal dialogue system designed to discern deeper or more subtle meanings beyond the literal interpretations of words through the integration of speech modality perception. Employing LLMs as a cognitive core, PerceptiveAgent perceives acoustic information from input speech and generates empathetic responses based on speaking styles…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLanguage, Metaphor, and Cognition · Language, Discourse, Communication Strategies · Speech and dialogue systems
