Training an Interactive Humanoid Robot Using Multimodal Deep   Reinforcement Learning

Heriberto Cuay\'ahuitl; Guillaume Couly; Cl\'ement Olalainty

arXiv:1611.08666·cs.LG·November 29, 2016·2 cites

Training an Interactive Humanoid Robot Using Multimodal Deep Reinforcement Learning

Heriberto Cuay\'ahuitl, Guillaume Couly, Cl\'ement Olalainty

PDF

Open Access 1 Repo

TL;DR

This paper presents a multimodal deep reinforcement learning approach to train a humanoid robot to perceive, interact, and play the game of noughts and crosses, demonstrating high success rates and fluent interactions.

Contribution

It introduces a novel multimodal deep reinforcement learning method enabling a humanoid robot to perceive and interact in a game environment with high accuracy and natural communication.

Findings

01

Robot learns to win or draw 98% of games in simulation.

02

The system achieves fluent verbal and non-verbal interactions.

03

Experimental results validate the effectiveness of the multimodal approach.

Abstract

Training robots to perceive, act and communicate using multiple modalities still represents a challenging problem, particularly if robots are expected to learn efficiently from small sets of example interactions. We describe a learning approach as a step in this direction, where we teach a humanoid robot how to play the game of noughts and crosses. Given that multiple multimodal skills can be trained to play this game, we focus our attention to training the robot to perceive the game, and to interact in this game. Our multimodal deep reinforcement learning agent perceives multimodal features and exhibits verbal and non-verbal actions while playing. Experimental results using simulations show that the robot can learn to win or draw up to 98% of the games. A pilot test of the proposed multimodal system for the targeted game---integrating speech, vision and gestures---reports that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cuayahuitl/SimpleDS
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Human Pose and Action Recognition · Social Robot Interaction and HRI