Towards self-attention based visual navigation in the real world

Jaime Ruiz-Serra; Jack White; Stephen Petrie; Tatiana Kameneva; Chris; McCarthy

arXiv:2209.07043·cs.RO·September 20, 2022

Towards self-attention based visual navigation in the real world

Jaime Ruiz-Serra, Jack White, Stephen Petrie, Tatiana Kameneva, Chris, McCarthy

PDF

Open Access

TL;DR

This paper explores the use of self-attention mechanisms in visual navigation, demonstrating their ability to bridge the reality gap and enable real-time processing of real-world images with minimal parameters.

Contribution

It systematically investigates hyperparameters for self-attention in 3D navigation, proposes strategies to enhance generalization, and shows real-time real-world image processing capabilities.

Findings

01

Self-attention based agents can generalize across environments.

02

Models trained in simulation effectively process real-world images.

03

Navigation performance improves with hyperparameter tuning.

Abstract

Vision guided navigation requires processing complex visual information to inform task-orientated decisions. Applications include autonomous robots, self-driving cars, and assistive vision for humans. A key element is the extraction and selection of relevant features in pixel space upon which to base action choices, for which Machine Learning techniques are well suited. However, Deep Reinforcement Learning agents trained in simulation often exhibit unsatisfactory results when deployed in the real-world due to perceptual differences known as the $reality gap$ . An approach that is yet to be explored to bridge this gap is self-attention. In this paper we (1) perform a systematic exploration of the hyperparameter space for self-attention based navigation of 3D environments and qualitatively appraise behaviour observed from different hyperparameter sets, including their ability to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Data Classification · Domain Adaptation and Few-Shot Learning

MethodsBalanced Selection