FastGrasp: Efficient Grasp Synthesis with Diffusion
Xiaofei Wu, Tao Liu, Caoji Li, Yuexin Ma, Yujiao Shi, Xuming He

TL;DR
FastGrasp introduces a diffusion-model-based method for rapid, diverse, and physically plausible human hand grasp synthesis, significantly outperforming prior two-stage approaches in speed and quality.
Contribution
The paper presents a novel one-stage diffusion model for hand-object grasp synthesis, improving efficiency and diversity over traditional two-stage methods.
Findings
Faster inference compared to state-of-the-art methods
Higher diversity in generated hand poses
Superior pose quality demonstrated through extensive experiments
Abstract
Effectively modeling the interaction between human hands and objects is challenging due to the complex physical constraints and the requirement for high generation efficiency in applications. Prior approaches often employ computationally intensive two-stage approaches, which first generate an intermediate representation, such as contact maps, followed by an iterative optimization procedure that updates hand meshes to capture the hand-object relation. However, due to the high computation complexity during the optimization stage, such strategies often suffer from low efficiency in inference. To address this limitation, this work introduces a novel diffusion-model-based approach that generates the grasping pose in a one-stage manner. This allows us to significantly improve generation speed and the diversity of generated hand poses. In particular, we develop a Latent Diffusion Model with an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEvolutionary Algorithms and Applications · Software Testing and Debugging Techniques · Modular Robots and Swarm Intelligence
MethodsLatent Diffusion Model · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Diffusion
