UniPlan: Vision-Language Task Planning for Mobile Manipulation with Unified PDDL Formulation
Haoming Ye, Yunxiao Xiao, Cewu Lu, Panpan Cai

TL;DR
UniPlan is a comprehensive vision-language planning system that extends symbolic manipulation to large-scale indoor environments, enabling mobile robot tasks with improved success rates and efficiency.
Contribution
It introduces a unified PDDL framework for long-horizon mobile manipulation, integrating scene topology, visuals, and robot capabilities for complex indoor tasks.
Findings
Outperforms existing methods in success rate and plan quality.
Operates efficiently on large-scale real-world maps.
Successfully handles navigation, door traversal, and bimanual tasks.
Abstract
Integration of VLM reasoning with symbolic planning has proven to be a promising approach to real-world robot task planning. Existing work like UniDomain effectively learns symbolic manipulation domains from real-world demonstrations, described in Planning Domain Definition Language (PDDL), and has successfully applied them to real-world tasks. These domains, however, are restricted to tabletop manipulation. We propose UniPlan, a vision-language task planning system for long-horizon mobile-manipulation in large-scale indoor environments, that unifies scene topology, visuals, and robot capabilities into a holistic PDDL representation. UniPlan programmatically extends learned tabletop domains from UniDomain to support navigation, door traversal, and bimanual coordination. It operates on a visual-topological map, comprising navigation landmarks anchored with scene images. Given a language…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Path Planning Algorithms · Multimodal Machine Learning Applications · Robotics and Sensor-Based Localization
