Exploiting Transformer in Sparse Reward Reinforcement Learning for   Interpretable Temporal Logic Motion Planning

Hao Zhang; Hao Wang; and Zhen Kan

arXiv:2209.13220·cs.RO·July 18, 2023

Exploiting Transformer in Sparse Reward Reinforcement Learning for Interpretable Temporal Logic Motion Planning

Hao Zhang, Hao Wang, and Zhen Kan

PDF

Open Access 1 Repo

TL;DR

This paper introduces T2TL, a novel framework that integrates Transformer models into reinforcement learning for interpretable temporal logic motion planning, improving task understanding and learning efficiency in complex robotic tasks.

Contribution

The paper proposes a Double-Transformer-guided Temporal Logic framework (T2TL) that enhances reinforcement learning with structured LTL instruction encoding and environment-agnostic pre-training.

Findings

01

T2TL effectively encodes LTL instructions for better task understanding.

02

Decomposition of tasks into sub-goals improves learning efficiency.

03

Simulation results validate the effectiveness of the proposed framework.

Abstract

Automaton based approaches have enabled robots to perform various complex tasks. However, most existing automaton based algorithms highly rely on the manually customized representation of states for the considered task, limiting its applicability in deep reinforcement learning algorithms. To address this issue, by incorporating Transformer into reinforcement learning, we develop a Double-Transformer-guided Temporal Logic framework (T2TL) that exploits the structural feature of Transformer twice, i.e., first encoding the LTL instruction via the Transformer module for efficient understanding of task instructions during the training and then encoding the context variable via the Transformer again for improved task performance. Particularly, the LTL instruction is specified by co-safe LTL. As a semantics-preserving rewriting operation, LTL progression is exploited to decompose the complex…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

charlie0257/t2tl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Human Pose and Action Recognition · Adversarial Robustness in Machine Learning

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Byte Pair Encoding · Softmax · Dropout · Dense Connections · Residual Connection · Absolute Position Encodings · Position-Wise Feed-Forward Layer