Translating Natural Language Instructions for Behavioral Robot Navigation with a Multi-Head Attention Mechanism
Patricio Cerda-Mardini, Vladimir Araujo, Alvaro Soto

TL;DR
This paper introduces a multi-head attention mechanism in a neural network to improve the translation of natural language instructions into behavioral commands for indoor robot navigation, enhancing generalization to unseen environments.
Contribution
It presents a novel multi-head attention blending layer that significantly improves instruction translation accuracy in unseen environments for robot navigation.
Findings
Enhanced translation accuracy in new environments
Significant performance improvements over previous models
Better generalization capabilities
Abstract
We propose a multi-head attention mechanism as a blending layer in a neural network model that translates natural language to a high level behavioral language for indoor robot navigation. We follow the framework established by (Zang et al., 2018a) that proposes the use of a navigation graph as a knowledge base for the task. Our results show significant performance gains when translating instructions on previously unseen environments, therefore, improving the generalization capabilities of the model.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsAttention Is All You Need · Softmax · Linear Layer · Multi-Head Attention
