RoboPCA: Pose-centered Affordance Learning from Human Demonstrations for Robot Manipulation

Zhanqi Xiao; Ruiping Wang; Xilin Chen

arXiv:2603.07691·cs.RO·March 10, 2026

RoboPCA: Pose-centered Affordance Learning from Human Demonstrations for Robot Manipulation

Zhanqi Xiao, Ruiping Wang, Xilin Chen

PDF

Open Access

TL;DR

RoboPCA introduces a joint prediction framework for contact regions and poses in robot manipulation, utilizing a novel data collection pipeline from human demonstrations to improve task performance and generalization.

Contribution

The paper presents RoboPCA, a pose-centered affordance prediction method with a new data curation pipeline, Human2Afford, enabling scalable learning from human demonstrations.

Findings

01

Outperforms baseline methods on image datasets, simulation, and real robots.

02

Demonstrates strong generalization across tasks and object categories.

03

Effectively integrates geometry-appearance cues and mask features in a diffusion framework.

Abstract

Understanding spatial affordances -- comprising the contact regions of object interaction and the corresponding contact poses -- is essential for robots to effectively manipulate objects and accomplish diverse tasks. However, existing spatial affordance prediction methods mainly focus on locating the contact regions while delegating the pose to independent pose estimation approaches, which can lead to task failures due to inconsistencies between predicted contact regions and candidate poses. In this work, we propose RoboPCA, a pose-centered affordance prediction framework that jointly predicts task-appropriate contact regions and poses conditioned on instructions. To enable scalable data collection for pose-centered affordance learning, we devise Human2Afford, a data curation pipeline that automatically recovers scene-level 3D information and infers pose-centered affordance annotations…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Social Robot Interaction and HRI