Control Transformer: Robot Navigation in Unknown Environments through   PRM-Guided Return-Conditioned Sequence Modeling

Daniel Lawson; Ahmed H. Qureshi

arXiv:2211.06407·cs.RO·July 14, 2023

Control Transformer: Robot Navigation in Unknown Environments through PRM-Guided Return-Conditioned Sequence Modeling

Daniel Lawson, Ahmed H. Qureshi

PDF

Open Access

TL;DR

This paper introduces Control Transformer, a novel approach combining sequence modeling and sampling-based planning to enable robots to navigate long-horizon, partially-observed environments and transfer to real-world scenarios without additional training.

Contribution

The paper presents a new framework that integrates PRM-guided sequence modeling for long-horizon robot navigation in unknown environments, demonstrating effective sim2real transfer.

Findings

01

Successfully navigates mazes with MuJoCo robots

02

Transfers to real Turtlebot3 robot in zero-shot setting

03

Outperforms baseline methods in long-horizon tasks

Abstract

Learning long-horizon tasks such as navigation has presented difficult challenges for successfully applying reinforcement learning to robotics. From another perspective, under known environments, sampling-based planning can robustly find collision-free paths in environments without learning. In this work, we propose Control Transformer that models return-conditioned sequences from low-level policies guided by a sampling-based Probabilistic Roadmap (PRM) planner. We demonstrate that our framework can solve long-horizon navigation tasks using only local information. We evaluate our approach on partially-observed maze navigation with MuJoCo robots, including Ant, Point, and Humanoid. We show that Control Transformer can successfully navigate through mazes and transfer to unknown environments. Additionally, we apply our method to a differential drive robot (Turtlebot3) and show zero-shot…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robotic Path Planning Algorithms · Robotic Locomotion and Control

MethodsMulti-Head Attention · Attention Is All You Need · Position-Wise Feed-Forward Layer · Label Smoothing · Layer Normalization · Linear Layer · Softmax · Adam · Absolute Position Encodings · Byte Pair Encoding