Establishing Human-Robot Trust through Music-Driven Robotic Emotion Prosody and Gesture
Richard Savery, Ryan Rose, Gil Weinberg

TL;DR
This paper introduces a music-driven emotional prosody and gesture model for robots to foster trust, demonstrating improved emotional expression and higher trust levels compared to traditional speech systems.
Contribution
The novel model integrates music-driven emotional cues with gestures to enhance human-robot trust, avoiding the uncanny valley and improving emotional perception.
Findings
System accurately portrayed a range of emotions.
Achieved 8% higher trust than state-of-the-art TTS.
User study confirmed effectiveness in emotional communication.
Abstract
As human-robot collaboration opportunities continue to expand, trust becomes ever more important for full engagement and utilization of robots. Affective trust, built on emotional relationship and interpersonal bonds is particularly critical as it is more resilient to mistakes and increases the willingness to collaborate. In this paper we present a novel model built on music-driven emotional prosody and gestures that encourages the perception of a robotic identity, designed to avoid uncanny valley. Symbolic musical phrases were generated and tagged with emotional information by human musicians. These phrases controlled a synthesis engine playing back pre-rendered audio samples generated through interpolation of phonemes and electronic instruments. Gestures were also driven by the symbolic phrases, encoding the emotion from the musical phrase to low degree-of-freedom movements. Through a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSocial Robot Interaction and HRI · Emotion and Mood Recognition · Action Observation and Synchronization
