FSAG: Enhancing Human-to-Dexterous-Hand Finger-Specific Affordance Grounding via Diffusion Models
Yifan Han, Yichuan Peng, Pengfei Yi, Junyan Li, Hanqing Wang, Gaojing Zhang, Qi Peng Liu, Wenzhao Lian

TL;DR
This paper presents a data-efficient, semantic affordance-based framework for dexterous grasp synthesis that leverages pretrained diffusion models and human demonstrations, enabling stable, generalizable grasps without extensive dataset collection.
Contribution
It introduces a novel semantic affordance extraction pipeline using vision-language generative priors and demonstrates cross-hand generalization without hardware-specific grasp datasets.
Findings
Produces stable, human-like multi-contact grasps across various objects.
Generalizes well to unseen objects, poses, and different hand embodiments.
Achieves high-quality grasp synthesis using only depth images and pretrained models.
Abstract
Dexterous grasp synthesis must jointly satisfy functional intent and physical feasibility, yet existing pipelines often decouple semantic grounding from refinement, yielding unstable or non-functional contacts under object and pose variations. This challenge is exacerbated by the high dimensionality and kinematic diversity of multi-fingered hands, which makes many methods rely on large, hardware-specific grasp datasets collected in simulation or through costly real-world trials. We propose a data-efficient framework that bypasses robot grasp data collection by exploiting object-centric semantic priors in pretrained generative diffusion models. Temporally aligned and fine-grained grasp affordances are extracted from raw human video demonstrations and fused with 3D scene geometry from depth images to infer semantically grounded contact targets. We further incorporate these affordance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Human Motion and Animation · Motor Control and Adaptation
