HandRefiner: Refining Malformed Hands in Generated Images by Diffusion-based Conditional Inpainting
Wenquan Lu, Yufei Xu, Jing Zhang, Chaoyue Wang, Dacheng Tao

TL;DR
HandRefiner is a lightweight post-processing method that uses conditional inpainting and hand mesh reconstruction to correct malformed hands in images generated by diffusion models, improving realism and accuracy.
Contribution
The paper introduces HandRefiner, a novel inpainting-based approach that leverages hand mesh models and ControlNet to refine malformed hands in generated images.
Findings
Significant improvement in hand realism and correctness in generated images.
Discovery of a phase transition in ControlNet control strength.
Effective use of synthetic data to enhance hand generation quality.
Abstract
Diffusion models have achieved remarkable success in generating realistic images but suffer from generating accurate human hands, such as incorrect finger counts or irregular shapes. This difficulty arises from the complex task of learning the physical structure and pose of hands from training images, which involves extensive deformations and occlusions. For correct hand generation, our paper introduces a lightweight post-processing solution called . HandRefiner employs a conditional inpainting approach to rectify malformed hands while leaving other parts of the image untouched. We leverage the hand mesh reconstruction model that consistently adheres to the correct number of fingers and hand shape, while also being capable of fitting the desired hand pose in the generated image. Given a generated failed image due to malformed hands, we utilize ControlNet modules to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Human Pose and Action Recognition · Face recognition and analysis
MethodsInpainting
