Exploiting Transformer in Sparse Reward Reinforcement Learning for Interpretable Temporal Logic Motion Planning
Hao Zhang, Hao Wang, and Zhen Kan

TL;DR
This paper introduces T2TL, a novel framework that integrates Transformer models into reinforcement learning for interpretable temporal logic motion planning, improving task understanding and learning efficiency in complex robotic tasks.
Contribution
The paper proposes a Double-Transformer-guided Temporal Logic framework (T2TL) that enhances reinforcement learning with structured LTL instruction encoding and environment-agnostic pre-training.
Findings
T2TL effectively encodes LTL instructions for better task understanding.
Decomposition of tasks into sub-goals improves learning efficiency.
Simulation results validate the effectiveness of the proposed framework.
Abstract
Automaton based approaches have enabled robots to perform various complex tasks. However, most existing automaton based algorithms highly rely on the manually customized representation of states for the considered task, limiting its applicability in deep reinforcement learning algorithms. To address this issue, by incorporating Transformer into reinforcement learning, we develop a Double-Transformer-guided Temporal Logic framework (T2TL) that exploits the structural feature of Transformer twice, i.e., first encoding the LTL instruction via the Transformer module for efficient understanding of task instructions during the training and then encoding the context variable via the Transformer again for improved task performance. Particularly, the LTL instruction is specified by co-safe LTL. As a semantics-preserving rewriting operation, LTL progression is exploited to decompose the complex…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Human Pose and Action Recognition · Adversarial Robustness in Machine Learning
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Byte Pair Encoding · Softmax · Dropout · Dense Connections · Residual Connection · Absolute Position Encodings · Position-Wise Feed-Forward Layer
