AnchorDP3: 3D Affordance Guided Sparse Diffusion Policy for Robotic Manipulation
Ziyan Zhao, Ke Fan, He-Yang Xu, Ning Qiao, Bo Peng, Wenlong Gao, Dongjiang Li, Hui Shen

TL;DR
AnchorDP3 introduces a novel diffusion policy framework for robotic manipulation that leverages semantic segmentation, task-conditioned encoding, and affordance-anchored keyposes to achieve high success rates in highly randomized environments.
Contribution
The paper presents a new diffusion policy approach integrating affordance-guided keyposes and multi-task learning for robust robotic manipulation without human demonstrations.
Findings
Achieves 98.7% success rate on RoboTwin benchmark.
Effectively handles extreme environmental randomization.
Demonstrates potential for autonomous visuomotor policy generation.
Abstract
We present AnchorDP3, a diffusion policy framework for dual-arm robotic manipulation that achieves state-of-the-art performance in highly randomized environments. AnchorDP3 integrates three key innovations: (1) Simulator-Supervised Semantic Segmentation, using rendered ground truth to explicitly segment task-critical objects within the point cloud, which provides strong affordance priors; (2) Task-Conditioned Feature Encoders, lightweight modules processing augmented point clouds per task, enabling efficient multi-task learning through a shared diffusion-based action expert; (3) Affordance-Anchored Keypose Diffusion with Full State Supervision, replacing dense trajectory prediction with sparse, geometrically meaningful action anchors, i.e., keyposes such as pre-grasp pose, grasp pose directly anchored to affordances, drastically simplifying the prediction space; the action expert is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsDiffusion
