Empathy Through Multimodality in Conversational Interfaces
Mahyar Abbasian, Iman Azimi, Mohammad Feli, Amir M. Rahmani, Ramesh, Jain

TL;DR
This paper presents a multimodal conversational health agent that interprets emotional cues to deliver empathetic responses, enhancing mental health support through advanced AI and emotional intelligence integration.
Contribution
It introduces an LLM-based multimodal health agent capable of emotional understanding and empathetic responses, utilizing the openCHA framework for improved digital health interactions.
Findings
High concordance between CHA responses and human evaluators' assessments
Vocal emotion recognition significantly enhances empathetic connection
Multimodal cues improve the consistency of emotional interpretation
Abstract
Agents represent one of the most emerging applications of Large Language Models (LLMs) and Generative AI, with their effectiveness hinging on multimodal capabilities to navigate complex user environments. Conversational Health Agents (CHAs), a prime example of this, are redefining healthcare by offering nuanced support that transcends textual analysis to incorporate emotional intelligence. This paper introduces an LLM-based CHA engineered for rich, multimodal dialogue-especially in the realm of mental health support. It adeptly interprets and responds to users' emotional states by analyzing multimodal cues, thus delivering contextually aware and empathetically resonant verbal responses. Our implementation leverages the versatile openCHA framework, and our comprehensive evaluation involves neutral prompts expressed in diverse emotional tones: sadness, anger, and joy. We evaluate the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLanguage, Metaphor, and Cognition · EFL/ESL Teaching and Learning · Digital Storytelling and Education
MethodsAttentive Walk-Aggregating Graph Neural Network
