AnchorDP3: 3D Affordance Guided Sparse Diffusion Policy for Robotic Manipulation

Ziyan Zhao; Ke Fan; He-Yang Xu; Ning Qiao; Bo Peng; Wenlong Gao; Dongjiang Li; Hui Shen

arXiv:2506.19269·cs.RO·June 26, 2025

AnchorDP3: 3D Affordance Guided Sparse Diffusion Policy for Robotic Manipulation

Ziyan Zhao, Ke Fan, He-Yang Xu, Ning Qiao, Bo Peng, Wenlong Gao, Dongjiang Li, Hui Shen

PDF

TL;DR

AnchorDP3 introduces a novel diffusion policy framework for robotic manipulation that leverages semantic segmentation, task-conditioned encoding, and affordance-anchored keyposes to achieve high success rates in highly randomized environments.

Contribution

The paper presents a new diffusion policy approach integrating affordance-guided keyposes and multi-task learning for robust robotic manipulation without human demonstrations.

Findings

01

Achieves 98.7% success rate on RoboTwin benchmark.

02

Effectively handles extreme environmental randomization.

03

Demonstrates potential for autonomous visuomotor policy generation.

Abstract

We present AnchorDP3, a diffusion policy framework for dual-arm robotic manipulation that achieves state-of-the-art performance in highly randomized environments. AnchorDP3 integrates three key innovations: (1) Simulator-Supervised Semantic Segmentation, using rendered ground truth to explicitly segment task-critical objects within the point cloud, which provides strong affordance priors; (2) Task-Conditioned Feature Encoders, lightweight modules processing augmented point clouds per task, enabling efficient multi-task learning through a shared diffusion-based action expert; (3) Affordance-Anchored Keypose Diffusion with Full State Supervision, replacing dense trajectory prediction with sparse, geometrically meaningful action anchors, i.e., keyposes such as pre-grasp pose, grasp pose directly anchored to affordances, drastically simplifying the prediction space; the action expert is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsDiffusion