Identity-Preserving Text-to-Image Generation via Dual-Level Feature Decoupling and Expert-Guided Fusion
Kewen Chen, Xiaobin Hu, Wenqi Ren

TL;DR
This paper introduces a novel framework for text-to-image generation that effectively preserves subject identity by decoupling identity-related features and fusing them with identity-irrelevant features, leading to higher quality and more diverse images.
Contribution
The work proposes a dual-level feature decoupling and expert-guided fusion framework, including IEDM and FFM modules, with new loss functions to better preserve identity in generated images.
Findings
Improved image quality and identity preservation.
Enhanced flexibility in scene adaptation.
Increased diversity of generated outputs.
Abstract
Recent advances in large-scale text-to-image generation models have led to a surge in subject-driven text-to-image generation, which aims to produce customized images that align with textual descriptions while preserving the identity of specific subjects. Despite significant progress, current methods struggle to disentangle identity-relevant information from identity-irrelevant details in the input images, resulting in overfitting or failure to maintain subject identity. In this work, we propose a novel framework that improves the separation of identity-related and identity-unrelated features and introduces an innovative feature fusion mechanism to improve the quality and text alignment of generated images. Our framework consists of two key components: an Implicit-Explicit foreground-background Decoupling Module (IEDM) and a Feature Fusion Module (FFM) based on a Mixture of Experts…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications · Image Enhancement Techniques
