UniManip: General-Purpose Zero-Shot Robotic Manipulation with Agentic Operational Graph
Haichao Liu, Yuanjiang Xue, Yuheng Zhou, Haoyuan Deng, Yinan Liang, Lihua Xie, Ziwei Wang

TL;DR
UniManip introduces a unified framework combining semantic reasoning and physical grounding through an agentic operational graph, enabling robust zero-shot robotic manipulation in unstructured environments with high success rates.
Contribution
The paper presents UniManip, a novel bi-level agentic operational graph framework that unifies semantic reasoning and physical grounding for zero-shot robotic manipulation.
Findings
Achieves 22.5% higher success rate than state-of-the-art VLA models.
Demonstrates 25.0% higher success rate than hierarchical baselines.
Enables zero-shot transfer from fixed-base to mobile manipulation without reconfiguration.
Abstract
Achieving general-purpose robotic manipulation requires robots to seamlessly bridge high-level semantic intent with low-level physical interaction in unstructured environments. However, existing approaches falter in zero-shot generalization: end-to-end Vision-Language-Action (VLA) models often lack the precision required for long-horizon tasks, while traditional hierarchical planners suffer from semantic rigidity when facing open-world variations. To address this, we present UniManip, a framework grounded in a Bi-level Agentic Operational Graph (AOG) that unifies semantic reasoning and physical grounding. By coupling a high-level Agentic Layer for task orchestration with a low-level Scene Layer for dynamic state representation, the system continuously aligns abstract planning with geometric constraints, enabling robust zero-shot execution. Unlike static pipelines, UniManip operates as a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Robot Manipulation and Learning · Reinforcement Learning in Robotics
