MegaPortrait: Revisiting Diffusion Control for High-fidelity Portrait Generation
Han Yang, Sotiris Anagnostidis, Enis Simsar, Thomas Hofmann

TL;DR
MegaPortrait is a novel system that combines multiple neural modules to generate high-fidelity, personalized portraits with superior identity preservation and image quality compared to existing AI portrait methods.
Contribution
It introduces a modular framework integrating Identity, Shading, and Harmonization Nets for improved portrait generation.
Findings
Outperforms state-of-the-art AI portrait products in identity preservation.
Achieves higher image fidelity and coherence in generated portraits.
Demonstrates effectiveness of combining off-the-shelf Controlnets with specialized modules.
Abstract
We propose MegaPortrait. It's an innovative system for creating personalized portrait images in computer vision. It has three modules: Identity Net, Shading Net, and Harmonization Net. Identity Net generates learned identity using a customized model fine-tuned with source images. Shading Net re-renders portraits using extracted representations. Harmonization Net fuses pasted faces and the reference image's body for coherent results. Our approach with off-the-shelf Controlnets is better than state-of-the-art AI portrait products in identity preservation and image fidelity. MegaPortrait has a simple but effective design and we compare it with other methods and products to show its superiority.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications · Computer Graphics and Visualization Techniques
