Multi-Keypoint Affordance Representation for Functional Dexterous Grasping

Fan Yang; Dongsheng Luo; Wenrui Chen; Jiacheng Lin; Junjie Cai; Kailun Yang; Zhiyong Li; Yaonan Wang

arXiv:2502.20018·cs.RO·August 13, 2025

Multi-Keypoint Affordance Representation for Functional Dexterous Grasping

Fan Yang, Dongsheng Luo, Wenrui Chen, Jiacheng Lin, Junjie Cai, Kailun Yang, Zhiyong Li, Yaonan Wang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel multi-keypoint affordance representation for functional dexterous grasping, directly encoding task-specific grasp configurations to improve manipulation accuracy and generalization in robotic systems.

Contribution

It proposes Contact-guided Multi-Keypoint Affordance (CMKA) and Keypoint-based Grasp matrix Transformation (KGT), enabling precise, generalizable, and direct visual-to-manipulation mapping without manual keypoint annotations.

Findings

01

Enhanced affordance localization accuracy

02

Improved grasp consistency and robustness

03

Better generalization to unseen tools and tasks

Abstract

Functional dexterous grasping requires precise hand-object interaction, going beyond simple gripping. Existing affordance-based methods primarily predict coarse interaction regions and cannot directly constrain the grasping posture, leading to a disconnection between visual perception and manipulation. To address this issue, we propose a multi-keypoint affordance representation for functional dexterous grasping, which directly encodes task-driven grasp configurations by localizing functional contact points. Our method introduces Contact-guided Multi-Keypoint Affordance (CMKA), leveraging human grasping experience images for weak supervision combined with Large Vision Models for fine affordance feature extraction, achieving generalization while avoiding manual keypoint annotations. Additionally, we present a Keypoint-based Grasp matrix Transformation (KGT) method, ensuring spatial…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

popeyepxx/mka
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Motor Control and Adaptation · Human Pose and Action Recognition