MagicView: Multi-View Consistent Identity Customization via Priors-Guided In-Context Learning
Hengjia Li, Jianjin Xu, Keli Cheng, Lei Wang, Ning Bi, Boxi Wu, Fernando De la Torre, Deng Cai

TL;DR
MagicView introduces a lightweight framework that enhances generative models with multi-view identity consistency using 3D priors and in-context learning, requiring minimal data and improving multi-view generation quality.
Contribution
The paper proposes a novel 3D priors-guided in-context learning approach for multi-view consistent identity customization in generative models, with a new Semantic Correspondence Alignment loss.
Findings
Outperforms recent baselines in multi-view consistency and visual quality
Achieves strong results with only 100 training samples
Effectively preserves semantic alignment across views
Abstract
Recent advances in personalized generative models have demonstrated impressive capabilities in producing identity-consistent images of the same individual across diverse scenes. However, most existing methods lack explicit viewpoint control and fail to ensure multi-view consistency of generated identities. To address this limitation, we present MagicView, a lightweight adaptation framework that equips existing generative models with multi-view generation capability through 3D priors-guided in-context learning. While prior studies have shown that in-context learning preserves identity consistency across grid samples, its effectiveness in multi-view settings remains unexplored. Building upon this insight, we conduct an in-depth analysis of the multi-view in-context learning ability, and design a conditioning architecture that leverages 3D priors to activate this capability for multi-view…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Face recognition and analysis · Multimodal Machine Learning Applications
