UniManip: General-Purpose Zero-Shot Robotic Manipulation with Agentic Operational Graph

Haichao Liu; Yuanjiang Xue; Yuheng Zhou; Haoyuan Deng; Yinan Liang; Lihua Xie; Ziwei Wang

arXiv:2602.13086·cs.RO·February 16, 2026

UniManip: General-Purpose Zero-Shot Robotic Manipulation with Agentic Operational Graph

Haichao Liu, Yuanjiang Xue, Yuheng Zhou, Haoyuan Deng, Yinan Liang, Lihua Xie, Ziwei Wang

PDF

Open Access

TL;DR

UniManip introduces a unified framework combining semantic reasoning and physical grounding through an agentic operational graph, enabling robust zero-shot robotic manipulation in unstructured environments with high success rates.

Contribution

The paper presents UniManip, a novel bi-level agentic operational graph framework that unifies semantic reasoning and physical grounding for zero-shot robotic manipulation.

Findings

01

Achieves 22.5% higher success rate than state-of-the-art VLA models.

02

Demonstrates 25.0% higher success rate than hierarchical baselines.

03

Enables zero-shot transfer from fixed-base to mobile manipulation without reconfiguration.

Abstract

Achieving general-purpose robotic manipulation requires robots to seamlessly bridge high-level semantic intent with low-level physical interaction in unstructured environments. However, existing approaches falter in zero-shot generalization: end-to-end Vision-Language-Action (VLA) models often lack the precision required for long-horizon tasks, while traditional hierarchical planners suffer from semantic rigidity when facing open-world variations. To address this, we present UniManip, a framework grounded in a Bi-level Agentic Operational Graph (AOG) that unifies semantic reasoning and physical grounding. By coupling a high-level Agentic Layer for task orchestration with a low-level Scene Layer for dynamic state representation, the system continuously aligns abstract planning with geometric constraints, enabling robust zero-shot execution. Unlike static pipelines, UniManip operates as a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Robot Manipulation and Learning · Reinforcement Learning in Robotics