Texture-Preserving Diffusion Models for High-Fidelity Virtual Try-On
Xu Yang, Changxing Ding, Zhibin Hong, Junhao Huang, Jin Tao, Xiangmin, Xu

TL;DR
This paper introduces a texture-preserving diffusion model for virtual try-on that improves fidelity without extra encoders by concatenating images and predicting inpainting masks, outperforming existing methods.
Contribution
The paper presents a novel diffusion-based virtual try-on approach that enhances texture transfer efficiency and accuracy without additional encoders, using image concatenation and mask prediction.
Findings
Outperforms state-of-the-art methods on VITON and VITON-HD datasets.
Enables efficient and accurate texture transfer with a single compact model.
Supports various try-on tasks like garment-to-person and person-to-person.
Abstract
Image-based virtual try-on is an increasingly important task for online shopping. It aims to synthesize images of a specific person wearing a specified garment. Diffusion model-based approaches have recently become popular, as they are excellent at image synthesis tasks. However, these approaches usually employ additional image encoders and rely on the cross-attention mechanism for texture transfer from the garment to the person image, which affects the try-on's efficiency and fidelity. To address these issues, we propose an Texture-Preserving Diffusion (TPD) model for virtual try-on, which enhances the fidelity of the results and introduces no additional image encoders. Accordingly, we make contributions from two aspects. First, we propose to concatenate the masked person and reference garment images along the spatial dimension and utilize the resulting image as the input for the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputer Graphics and Visualization Techniques · 3D Shape Modeling and Analysis · Generative Adversarial Networks and Image Synthesis
MethodsDiffusion · Inpainting
