Sequence-Based Plan Feasibility Prediction for Efficient Task and Motion Planning
Zhutian Yang, Caelan Reed Garrett, Tom\'as Lozano-P\'erez, Leslie, Kaelbling, Dieter Fox

TL;DR
This paper introduces PIGINet, a Transformer-based plan feasibility predictor integrated into a TAMP algorithm, significantly improving planning efficiency and enabling zero-shot generalization in complex mobile manipulation tasks.
Contribution
The paper presents a novel Transformer-based learning method, PIGINet, that predicts plan feasibility to bias and accelerate task and motion planning in complex environments.
Findings
PIGINet reduces planning runtime by up to 80%.
The method generalizes to unseen object categories without additional training.
Training on 150-600 problems suffices for substantial efficiency gains.
Abstract
We present a learning-enabled Task and Motion Planning (TAMP) algorithm for solving mobile manipulation problems in environments with many articulated and movable obstacles. Our idea is to bias the search procedure of a traditional TAMP planner with a learned plan feasibility predictor. The core of our algorithm is PIGINet, a novel Transformer-based learning method that takes in a task plan, the goal, and the initial state, and predicts the probability of finding motion trajectories associated with the task plan. We integrate PIGINet within a TAMP planner that generates a diverse set of high-level task plans, sorts them by their predicted likelihood of feasibility, and refines them in that order. We evaluate the runtime of our TAMP algorithm on seven families of kitchen rearrangement problems, comparing its performance to that of non-learning baselines. Our experiments show that PIGINet…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Path Planning Algorithms · Multimodal Machine Learning Applications · Human Pose and Action Recognition
