HandDreamer: Zero-Shot Text to 3D Hand Model Generation using Corrective Hand Shape Guidance

Green Rosh; Prateek Kukreja; Vishakha SR; Pawan Prasad B H

arXiv:2604.04425·cs.CV·April 7, 2026

HandDreamer: Zero-Shot Text to 3D Hand Model Generation using Corrective Hand Shape Guidance

Green Rosh, Prateek Kukreja, Vishakha SR, Pawan Prasad B H

PDF

TL;DR

HandDreamer is a novel zero-shot text-to-3D hand model generation method that uses a hand skeleton guided diffusion process and corrective shape guidance to produce view-consistent, detailed, and customizable 3D hand models.

Contribution

It introduces the first zero-shot 3D hand model generation approach from text prompts, addressing view-inconsistencies with a skeleton-guided diffusion and corrective shape loss.

Findings

01

Outperforms state-of-the-art methods in 3D hand model quality.

02

Ensures view and pose consistency in generated hand models.

03

Effectively handles large variations in hand articulations and poses.

Abstract

The emergence of virtual reality has necessitated the generation of detailed and customizable 3D hand models for interaction in the virtual world. However, the current methods for 3D hand model generation are both expensive and cumbersome, offering very little customizability to the users. While recent advancements in zero-shot text-to-3D synthesis have enabled the generation of diverse and customizable 3D models using Score Distillation Sampling (SDS), they do not generalize very well to 3D hand model generation, resulting in unnatural hand structures, view-inconsistencies and loss of details. To address these limitations, we introduce HandDreamer, the first method for zero-shot 3D hand model generation from text prompts. Our findings suggest that view-inconsistencies in SDS is primarily caused due to the ambiguity in the probability landscape described by the text prompt, resulting in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.