Goal-Guided Transformer-Enabled Reinforcement Learning for Efficient Autonomous Navigation
Wenhui Huang, Yanxin Zhou, Xiangkun He, and Chen Lv

TL;DR
This paper introduces a goal-guided Transformer-based reinforcement learning approach that significantly improves data efficiency and navigation performance in autonomous systems by integrating goal information into scene perception.
Contribution
The paper proposes a novel Goal-guided Transformer (GoT) architecture pre-trained with expert priors, enhancing data efficiency and goal relevance in DRL-based autonomous navigation.
Findings
Enhanced data efficiency in navigation tasks
Superior performance in simulation and real-world tests
Improved robustness and generalization to real-world scenarios
Abstract
Despite some successful applications of goal-driven navigation, existing deep reinforcement learning (DRL)-based approaches notoriously suffers from poor data efficiency issue. One of the reasons is that the goal information is decoupled from the perception module and directly introduced as a condition of decision-making, resulting in the goal-irrelevant features of the scene representation playing an adversary role during the learning process. In light of this, we present a novel Goal-guided Transformer-enabled reinforcement learning (GTRL) approach by considering the physical goal states as an input of the scene encoder for guiding the scene representation to couple with the goal information and realizing efficient autonomous navigation. More specifically, we propose a novel variant of the Vision Transformer as the backbone of the perception system, namely Goal-guided Transformer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Robotics and Sensor-Based Localization · Domain Adaptation and Few-Shot Learning
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Layer Normalization · Softmax · Adam · Byte Pair Encoding · Residual Connection · Label Smoothing · Dropout
