Robots Learn Social Skills: End-to-End Learning of Co-Speech Gesture   Generation for Humanoid Robots

Youngwoo Yoon; Woo-Ri Ko; Minsu Jang; Jaeyeon Lee; Jaehong Kim,; Geehyuk Lee

arXiv:1810.12541·cs.RO·October 31, 2018

Robots Learn Social Skills: End-to-End Learning of Co-Speech Gesture Generation for Humanoid Robots

Youngwoo Yoon, Woo-Ri Ko, Minsu Jang, Jaeyeon Lee, Jaehong Kim,, Geehyuk Lee

PDF

5 Repos

TL;DR

This paper introduces an end-to-end neural network model that learns to generate human-like co-speech gestures from speech data, improving robot-human interaction without relying on rule-based systems.

Contribution

It presents a novel learning-based approach for co-speech gesture generation from speech data, trained on 52 hours of TED talks, enabling real-time, diverse gesture production.

Findings

01

Generated gestures were perceived as human-like and contextually appropriate.

02

The model successfully produced various gesture types including iconic, metaphoric, deictic, and beat gestures.

03

Real-time gesture generation demonstrated on a NAO robot.

Abstract

Co-speech gestures enhance interaction experiences between humans as well as between humans and robots. Existing robots use rule-based speech-gesture association, but this requires human labor and prior knowledge of experts to be implemented. We present a learning-based co-speech gesture generation that is learned from 52 h of TED talks. The proposed end-to-end neural network model consists of an encoder for speech text understanding and a decoder to generate a sequence of gestures. The model successfully produces various gestures including iconic, metaphoric, deictic, and beat gestures. In a subjective evaluation, participants reported that the gestures were human-like and matched the speech content. We also demonstrate a co-speech gesture with a NAO robot working in real time.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.