A Data-Efficient Deep Learning Approach for Deployable Multimodal Social   Robots

Heriberto Cuay\'ahuitl

arXiv:1908.10398·cs.AI·August 29, 2019

A Data-Efficient Deep Learning Approach for Deployable Multimodal Social Robots

Heriberto Cuay\'ahuitl

PDF

TL;DR

This paper presents a data-efficient deep learning approach enabling social robots to learn multimodal games like Noughts & Crosses with minimal data, robust perception, and competitive strategies, validated through human and automatic evaluations.

Contribution

It introduces novel algorithms for visual tracking and policy learning, demonstrating effective, data-efficient training of humanoid robots for multimodal game interaction.

Findings

01

High winning rates in game play surpassing DQN baselines

02

Robust visual perception is crucial for successful interaction

03

Approach can be extended to other games with similar success

Abstract

The deep supervised and reinforcement learning paradigms (among others) have the potential to endow interactive multimodal social robots with the ability of acquiring skills autonomously. But it is still not very clear yet how they can be best deployed in real world applications. As a step in this direction, we propose a deep learning-based approach for efficiently training a humanoid robot to play multimodal games---and use the game of `Noughts & Crosses' with two variants as a case study. Its minimum requirements for learning to perceive and interact are based on a few hundred example images, a few example multimodal dialogues and physical demonstrations of robot manipulation, and automatic simulations. In addition, we propose novel algorithms for robust visual game tracking and for competitive policy learning with high winning rates, which substantially outperform DQN-based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.