DPDEdit: Detail-Preserved Diffusion Models for Multimodal Fashion Image Editing
Xiaolong Wang, Zhi-Qi Cheng, Jue Wang, Xiaojiang Peng

TL;DR
DPDEdit is a novel multimodal fashion image editing framework that accurately locates editing regions and preserves garment textures using a combination of grounded region prediction, texture injection, and refinement mechanisms.
Contribution
This work introduces DPDEdit, a new architecture that integrates multiple modalities and a texture refinement process for improved fashion image editing.
Findings
Outperforms existing methods in image fidelity
Achieves better coherence with multimodal inputs
Effectively preserves garment textures during editing
Abstract
Fashion image editing is a crucial tool for designers to convey their creative ideas by visualizing design concepts interactively. Current fashion image editing techniques, though advanced with multimodal prompts and powerful diffusion models, often struggle to accurately identify editing regions and preserve the desired garment texture detail. To address these challenges, we introduce a new multimodal fashion image editing architecture based on latent diffusion models, called Detail-Preserved Diffusion Models (DPDEdit). DPDEdit guides the fashion image generation of diffusion models by integrating text prompts, region masks, human pose images, and garment texture images. To precisely locate the editing region, we first introduce Grounded-SAM to predict the editing region based on the user's textual description, and then combine it with other conditions to perform local editing. To…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · 3D Shape Modeling and Analysis
MethodsMax Pooling · Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Diffusion · Concatenated Skip Connection · U-Net
