Texture-Preserving Diffusion Models for High-Fidelity Virtual Try-On

Xu Yang; Changxing Ding; Zhibin Hong; Junhao Huang; Jin Tao; Xiangmin; Xu

arXiv:2404.01089·cs.CV·April 2, 2024·1 cites

Texture-Preserving Diffusion Models for High-Fidelity Virtual Try-On

Xu Yang, Changxing Ding, Zhibin Hong, Junhao Huang, Jin Tao, Xiangmin, Xu

PDF

Open Access 1 Repo 1 Models

TL;DR

This paper introduces a texture-preserving diffusion model for virtual try-on that improves fidelity without extra encoders by concatenating images and predicting inpainting masks, outperforming existing methods.

Contribution

The paper presents a novel diffusion-based virtual try-on approach that enhances texture transfer efficiency and accuracy without additional encoders, using image concatenation and mask prediction.

Findings

01

Outperforms state-of-the-art methods on VITON and VITON-HD datasets.

02

Enables efficient and accurate texture transfer with a single compact model.

03

Supports various try-on tasks like garment-to-person and person-to-person.

Abstract

Image-based virtual try-on is an increasingly important task for online shopping. It aims to synthesize images of a specific person wearing a specified garment. Diffusion model-based approaches have recently become popular, as they are excellent at image synthesis tasks. However, these approaches usually employ additional image encoders and rely on the cross-attention mechanism for texture transfer from the garment to the person image, which affects the try-on's efficiency and fidelity. To address these issues, we propose an Texture-Preserving Diffusion (TPD) model for virtual try-on, which enhances the fidelity of the results and introduces no additional image encoders. Accordingly, we make contributions from two aspects. First, we propose to concatenate the masked person and reference garment images along the spatial dimension and utilize the resulting image as the input for the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gal4way/tpd
pytorchOfficial

Models

🤗
sahirp/TPD-VITON
model· ♡ 2
♡ 2

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputer Graphics and Visualization Techniques · 3D Shape Modeling and Analysis · Generative Adversarial Networks and Image Synthesis

MethodsDiffusion · Inpainting