Translating Natural Language Instructions for Behavioral Robot   Navigation with a Multi-Head Attention Mechanism

Patricio Cerda-Mardini; Vladimir Araujo; Alvaro Soto

arXiv:2006.00697·cs.LG·June 9, 2020

Translating Natural Language Instructions for Behavioral Robot Navigation with a Multi-Head Attention Mechanism

Patricio Cerda-Mardini, Vladimir Araujo, Alvaro Soto

PDF

TL;DR

This paper introduces a multi-head attention mechanism in a neural network to improve the translation of natural language instructions into behavioral commands for indoor robot navigation, enhancing generalization to unseen environments.

Contribution

It presents a novel multi-head attention blending layer that significantly improves instruction translation accuracy in unseen environments for robot navigation.

Findings

01

Enhanced translation accuracy in new environments

02

Significant performance improvements over previous models

03

Better generalization capabilities

Abstract

We propose a multi-head attention mechanism as a blending layer in a neural network model that translates natural language to a high level behavioral language for indoor robot navigation. We follow the framework established by (Zang et al., 2018a) that proposes the use of a navigation graph as a knowledge base for the task. Our results show significant performance gains when translating instructions on previously unseen environments, therefore, improving the generalization capabilities of the model.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsAttention Is All You Need · Softmax · Linear Layer · Multi-Head Attention