Towards self-attention based visual navigation in the real world
Jaime Ruiz-Serra, Jack White, Stephen Petrie, Tatiana Kameneva, Chris, McCarthy

TL;DR
This paper explores the use of self-attention mechanisms in visual navigation, demonstrating their ability to bridge the reality gap and enable real-time processing of real-world images with minimal parameters.
Contribution
It systematically investigates hyperparameters for self-attention in 3D navigation, proposes strategies to enhance generalization, and shows real-time real-world image processing capabilities.
Findings
Self-attention based agents can generalize across environments.
Models trained in simulation effectively process real-world images.
Navigation performance improves with hyperparameter tuning.
Abstract
Vision guided navigation requires processing complex visual information to inform task-orientated decisions. Applications include autonomous robots, self-driving cars, and assistive vision for humans. A key element is the extraction and selection of relevant features in pixel space upon which to base action choices, for which Machine Learning techniques are well suited. However, Deep Reinforcement Learning agents trained in simulation often exhibit unsatisfactory results when deployed in the real-world due to perceptual differences known as the . An approach that is yet to be explored to bridge this gap is self-attention. In this paper we (1) perform a systematic exploration of the hyperparameter space for self-attention based navigation of 3D environments and qualitatively appraise behaviour observed from different hyperparameter sets, including their ability to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Data Classification · Domain Adaptation and Few-Shot Learning
MethodsBalanced Selection
