Adaptive Multi-Modal Control of Digital Human Hand Synthesis Using a Region-Aware Cycle Loss
Qifan Fu, Xiaohang Yang, Muhammad Asad, Changjae Oh, Shanxin Yuan,, Gregory Slabaugh

TL;DR
This paper introduces a novel diffusion model training method with a region-aware cycle loss to enhance the accuracy and quality of hand pose synthesis in digital human images, leveraging multi-modal data and a new dataset.
Contribution
It proposes a region-aware cycle loss and multi-modal fusion approach, along with a new dataset, to significantly improve hand region synthesis in diffusion-based human image generation.
Findings
Enhanced hand pose quality in generated images
Improved overall pose accuracy with RACL
Effective multi-modal fusion for detailed hand synthesis
Abstract
Diffusion models have shown their remarkable ability to synthesize images, including the generation of humans in specific poses. However, current models face challenges in adequately expressing conditional control for detailed hand pose generation, leading to significant distortion in the hand regions. To tackle this problem, we first curate the How2Sign dataset to provide richer and more accurate hand pose annotations. In addition, we introduce adaptive, multi-modal fusion to integrate characters' physical features expressed in different modalities such as skeleton, depth, and surface normal. Furthermore, we propose a novel Region-Aware Cycle Loss (RACL) that enables the diffusion model training to focus on improving the hand region, resulting in improved quality of generated hand gestures. More specifically, the proposed RACL computes a weighted keypoint distance between the full-body…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHand Gesture Recognition Systems · Ergonomics and Musculoskeletal Disorders · Augmented Reality Applications
MethodsDiffusion · Focus
