Decomposed Vector-Quantized Variational Autoencoder for Human Grasp Generation
Zhe Zhao, Mengshi Qi, Huadong Ma

TL;DR
This paper introduces a decomposed vector-quantized VAE that improves the realism and accuracy of human grasp generation by separately encoding hand parts and employing a dual-stage decoding process, outperforming existing methods.
Contribution
The paper proposes a novel decomposed architecture and dual-stage decoding strategy for more precise and realistic human grasp generation, addressing limitations of previous holistic approaches.
Findings
Achieved 14.1% improvement in quality index over state-of-the-art methods.
Enhanced realism and adaptability in human grasp generation.
Effective management of hand-object interaction through part-aware encoding.
Abstract
Generating realistic human grasps is a crucial yet challenging task for applications involving object manipulation in computer graphics and robotics. Existing methods often struggle with generating fine-grained realistic human grasps that ensure all fingers effectively interact with objects, as they focus on encoding hand with the whole representation and then estimating both hand posture and position in a single step. In this paper, we propose a novel Decomposed Vector-Quantized Variational Autoencoder (DVQ-VAE) to address this limitation by decomposing hand into several distinct parts and encoding them separately. This part-aware decomposed architecture facilitates more precise management of the interaction between each component of hand and object, enhancing the overall reality of generated human grasps. Furthermore, we design a newly dual-stage decoding strategy, by first…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Robotic Locomotion and Control · Robot Manipulation and Learning
MethodsFocus
