Multi-Keypoint Affordance Representation for Functional Dexterous Grasping
Fan Yang, Dongsheng Luo, Wenrui Chen, Jiacheng Lin, Junjie Cai, Kailun Yang, Zhiyong Li, Yaonan Wang

TL;DR
This paper introduces a novel multi-keypoint affordance representation for functional dexterous grasping, directly encoding task-specific grasp configurations to improve manipulation accuracy and generalization in robotic systems.
Contribution
It proposes Contact-guided Multi-Keypoint Affordance (CMKA) and Keypoint-based Grasp matrix Transformation (KGT), enabling precise, generalizable, and direct visual-to-manipulation mapping without manual keypoint annotations.
Findings
Enhanced affordance localization accuracy
Improved grasp consistency and robustness
Better generalization to unseen tools and tasks
Abstract
Functional dexterous grasping requires precise hand-object interaction, going beyond simple gripping. Existing affordance-based methods primarily predict coarse interaction regions and cannot directly constrain the grasping posture, leading to a disconnection between visual perception and manipulation. To address this issue, we propose a multi-keypoint affordance representation for functional dexterous grasping, which directly encodes task-driven grasp configurations by localizing functional contact points. Our method introduces Contact-guided Multi-Keypoint Affordance (CMKA), leveraging human grasping experience images for weak supervision combined with Large Vision Models for fine affordance feature extraction, achieving generalization while avoiding manual keypoint annotations. Additionally, we present a Keypoint-based Grasp matrix Transformation (KGT) method, ensuring spatial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Motor Control and Adaptation · Human Pose and Action Recognition
