DockAnywhere: Data-Efficient Visuomotor Policy Learning for Mobile Manipulation via Novel Demonstration Generation

Ziyu Shan; Yuheng Zhou; Gaoyuan Wu; Ziheng Ji; Zhenyu Wu; Ziwei Wang

arXiv:2604.15023·cs.RO·April 17, 2026

DockAnywhere: Data-Efficient Visuomotor Policy Learning for Mobile Manipulation via Novel Demonstration Generation

Ziyu Shan, Yuheng Zhou, Gaoyuan Wu, Ziheng Ji, Zhenyu Wu, Ziwei Wang

PDF

TL;DR

DockAnywhere is a novel framework that enhances viewpoint generalization in mobile manipulation by generating diverse feasible docking demonstrations from a single trajectory, improving real-world policy success.

Contribution

It introduces a low-cost demonstration generation method that decouples docking and manipulation, enabling better generalization across different viewpoints and docking points.

Findings

01

Significantly improves policy success rates in experiments.

02

Enhances generalization to unseen viewpoints and docking configurations.

03

Demonstrates effectiveness on ManiSkill and real-world platforms.

Abstract

Mobile manipulation is a fundamental capability that enables robots to interact in expansive environments such as homes and factories. Most existing approaches follow a two-stage paradigm, where the robot first navigates to a docking point and then performs fixed-base manipulation using powerful visuomotor policies. However, real-world mobile manipulation often suffers from the view generalization problem due to shifts of docking points. To address this issue, we propose a novel low-cost demonstration generation framework named DockAnywhere, which improves viewpoint generalization under docking variability by lifting a single demonstration to diverse feasible docking configurations. Specifically, DockAnywhere lifts a trajectory to any feasible docking points by decoupling docking-dependent base motions from contact-rich manipulation skills that remain invariant across viewpoints.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.