Decomposed Vector-Quantized Variational Autoencoder for Human Grasp   Generation

Zhe Zhao; Mengshi Qi; Huadong Ma

arXiv:2407.14062·cs.CV·July 22, 2024

Decomposed Vector-Quantized Variational Autoencoder for Human Grasp Generation

Zhe Zhao, Mengshi Qi, Huadong Ma

PDF

Open Access 1 Repo

TL;DR

This paper introduces a decomposed vector-quantized VAE that improves the realism and accuracy of human grasp generation by separately encoding hand parts and employing a dual-stage decoding process, outperforming existing methods.

Contribution

The paper proposes a novel decomposed architecture and dual-stage decoding strategy for more precise and realistic human grasp generation, addressing limitations of previous holistic approaches.

Findings

01

Achieved 14.1% improvement in quality index over state-of-the-art methods.

02

Enhanced realism and adaptability in human grasp generation.

03

Effective management of hand-object interaction through part-aware encoding.

Abstract

Generating realistic human grasps is a crucial yet challenging task for applications involving object manipulation in computer graphics and robotics. Existing methods often struggle with generating fine-grained realistic human grasps that ensure all fingers effectively interact with objects, as they focus on encoding hand with the whole representation and then estimating both hand posture and position in a single step. In this paper, we propose a novel Decomposed Vector-Quantized Variational Autoencoder (DVQ-VAE) to address this limitation by decomposing hand into several distinct parts and encoding them separately. This part-aware decomposed architecture facilitates more precise management of the interaction between each component of hand and object, enhancing the overall reality of generated human grasps. Furthermore, we design a newly dual-stage decoding strategy, by first…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

florasion/d-vqvae
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Robotic Locomotion and Control · Robot Manipulation and Learning

MethodsFocus