Learning to Generate Context-Sensitive Backchannel Smiles for Embodied AI Agents with Applications in Mental Health Dialogues
Maneesh Bilalpur, Mert Inan, Dorsa Zeinali, Jeffrey F. Cohn, Malihe, Alikhani

TL;DR
This paper develops a generative model for context-sensitive backchannel smiles in embodied AI agents, enhancing their ability to build rapport in mental health dialogue scenarios, with improved perception and interaction quality.
Contribution
It introduces a novel attention-based generation approach for backchannel smiles conditioned on speech and demographic cues, advancing non-verbal behavior synthesis in embodied agents.
Findings
Listener information improves smile generation performance.
Conditioned generation yields statistically significant quality improvements.
Generated smiles increase perceived human-likeness of agents.
Abstract
Addressing the critical shortage of mental health resources for effective screening, diagnosis, and treatment remains a significant challenge. This scarcity underscores the need for innovative solutions, particularly in enhancing the accessibility and efficacy of therapeutic support. Embodied agents with advanced interactive capabilities emerge as a promising and cost-effective supplement to traditional caregiving methods. Crucial to these agents' effectiveness is their ability to simulate non-verbal behaviors, like backchannels, that are pivotal in establishing rapport and understanding in therapeutic contexts but remain under-explored. To improve the rapport-building capabilities of embodied agents we annotated backchannel smiles in videos of intimate face-to-face conversations over topics such as mental health, illness, and relationships. We hypothesized that both speaker and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition · Neuroscience and Music Perception · Social Robot Interaction and HRI
