ManiDext: Hand-Object Manipulation Synthesis via Continuous Correspondence Embeddings and Residual-Guided Diffusion
Jiajun Zhang, Yuxiang Zhang, Liang An, Mengcheng Li and, Hongwen Zhang, Zonghai Hu, Yebin Liu

TL;DR
ManiDext introduces a diffusion-based framework that models detailed hand-object contact correspondences to generate realistic, physically plausible manipulation motions, advancing the synthesis of complex hand-object interactions.
Contribution
It proposes a novel continuous correspondence embedding and residual-guided diffusion process for improved hand-object manipulation synthesis.
Findings
Generates realistic hand-object manipulation motions.
Effectively models contact correspondences at vertex level.
Outperforms existing methods in physical plausibility and realism.
Abstract
Dynamic and dexterous manipulation of objects presents a complex challenge, requiring the synchronization of hand motions with the trajectories of objects to achieve seamless and physically plausible interactions. In this work, we introduce ManiDext, a unified hierarchical diffusion-based framework for generating hand manipulation and grasp poses based on 3D object trajectories. Our key insight is that accurately modeling the contact correspondences between objects and hands during interactions is crucial. Therefore, we propose a continuous correspondence embedding representation that specifies detailed hand correspondences at the vertex level between the object and the hand. This embedding is optimized directly on the hand mesh in a self-supervised manner, with the distance between embeddings reflecting the geodesic distance. Our framework first generates contact maps and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHand Gesture Recognition Systems · Robot Manipulation and Learning · Human Pose and Action Recognition
MethodsDiffusion
