MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance
Xierui Wang, Siming Fu, Qihan Huang, Wanggui He, Hao Jiang

TL;DR
MS-Diffusion is a novel framework that enables zero-shot, layout-guided multi-subject image personalization in text-to-image generation, maintaining subject details and ensuring cohesive multi-subject compositions.
Contribution
The paper introduces MS-Diffusion, a new approach integrating grounding tokens and enhanced cross-attention for improved multi-subject image personalization with layout guidance.
Findings
Outperforms existing models in image fidelity.
Achieves better preservation of subject details.
Ensures cohesive multi-subject compositions.
Abstract
Recent advancements in text-to-image generation models have dramatically enhanced the generation of photorealistic images from textual prompts, leading to an increased interest in personalized text-to-image applications, particularly in multi-subject scenarios. However, these advances are hindered by two main challenges: firstly, the need to accurately maintain the details of each referenced subject in accordance with the textual descriptions; and secondly, the difficulty in achieving a cohesive representation of multiple subjects in a single image without introducing inconsistencies. To address these concerns, our research introduces the MS-Diffusion framework for layout-guided zero-shot image personalization with multi-subjects. This innovative approach integrates grounding tokens with the feature resampler to maintain detail fidelity among subjects. With the layout guidance,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Generative Adversarial Networks and Image Synthesis · Medical Image Segmentation Techniques
