Sketch2Human: Deep Human Generation with Disentangled Geometry and Appearance Control
Linzi Qu, Jiaxiang Shang, Hui Ye, Xiaoguang Han, and Hongbo Fu

TL;DR
Sketch2Human is a novel system enabling controllable full-body human image generation using semantic sketches for geometry and reference images for appearance, achieving high fidelity and diversity.
Contribution
It introduces a disentangled control framework for geometry and appearance in full-body human generation guided by sketches and images, based on StyleGAN-Human.
Findings
Outperforms state-of-the-art methods in qualitative and quantitative evaluations.
Handles hand-drawn sketches effectively.
Achieves high-fidelity, diverse, and controllable human image synthesis.
Abstract
Geometry- and appearance-controlled full-body human image generation is an interesting but challenging task. Existing solutions are either unconditional or dependent on coarse conditions (e.g., pose, text), thus lacking explicit geometry and appearance control of body and garment. Sketching offers such editing ability and has been adopted in various sketch-based face generation and editing solutions. However, directly adapting sketch-based face generation to full-body generation often fails to produce high-fidelity and diverse results due to the high complexity and diversity in the pose, body shape, and garment shape and texture. Recent geometrically controllable diffusion-based methods mainly rely on prompts to generate appearance and it is hard to balance the realism and the faithfulness of their results to the sketch when the input is coarse. This work presents Sketch2Human, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Human Pose and Action Recognition · Generative Adversarial Networks and Image Synthesis
