MagicFace: Training-free Universal-Style Human Image Customized Synthesis
Yibin Wang, Weizhong Zhang, Cheng Jin

TL;DR
MagicFace is a training-free, multi-concept human image synthesis method that creates personalized images by simulating human concept creation, using a coarse-to-fine pipeline with novel attention mechanisms.
Contribution
It introduces a training-free approach with a two-stage pipeline and new attention modules for flexible, multi-concept human image personalization.
Findings
Outperforms existing methods in personalized human image synthesis.
Effectively handles multiple concepts without fine-tuning.
Demonstrates superior quality and flexibility in experiments.
Abstract
Current human image customization methods leverage Stable Diffusion (SD) for its rich semantic prior. However, since SD is not specifically designed for human-oriented generation, these methods often require extensive fine-tuning on large-scale datasets, which renders them susceptible to overfitting and hinders their ability to personalize individuals with previously unseen styles. Moreover, these methods extensively focus on single-concept human image synthesis and lack the flexibility to customize individuals using multiple given concepts, thereby impeding their broader practical application. This paper proposes MagicFace, a novel training-free method for multi-concept universal-style human image personalized synthesis. Our core idea is to simulate how humans create images given specific concepts, i.e., first establish a semantic layout considering factors such as concepts' shape and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Human Pose and Action Recognition · Face recognition and analysis
MethodsSoftmax · Attention Is All You Need · Diffusion · Focus
