Embodying Pre-Trained Word Embeddings Through Robot Actions

Minori Toyoda; Kanata Suzuki; Hiroki Mori; Yoshihiko Hayashi; Tetsuya; Ogata

arXiv:2104.08521·cs.RO·April 20, 2021

Embodying Pre-Trained Word Embeddings Through Robot Actions

Minori Toyoda, Kanata Suzuki, Hiroki Mori, Yoshihiko Hayashi, Tetsuya, Ogata

PDF

TL;DR

This paper introduces a neural network model that grounds pre-trained word embeddings in robot sensory-motor experiences, enabling robots to better understand and generate actions from linguistic descriptions, including unseen words.

Contribution

It presents a method to retrofit pre-trained word embeddings with robot experiences, improving grounded language understanding in robots.

Findings

01

Embeddings of synonyms form semantic clusters based on robot experiences.

02

The model enables robots to generate actions from unseen words.

03

Grounded embeddings improve robot's linguistic and action comprehension.

Abstract

We propose a promising neural network model with which to acquire a grounded representation of robot actions and the linguistic descriptions thereof. Properly responding to various linguistic expressions, including polysemous words, is an important ability for robots that interact with people via linguistic dialogue. Previous studies have shown that robots can use words that are not included in the action-description paired datasets by using pre-trained word embeddings. However, the word embeddings trained under the distributional hypothesis are not grounded, as they are derived purely from a text corpus. In this letter, we transform the pre-trained word embeddings to embodied ones by using the robot's sensory-motor experiences. We extend a bidirectional translation model for actions and descriptions by incorporating non-linear layers that retrofit the word embeddings. By training the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.