Improving Diffusion Models for Authentic Virtual Try-on in the Wild
Yisol Choi, Sangkyung Kwak, Kyungmin Lee, Hyungwon Choi, Jinwoo Shin

TL;DR
This paper introduces IDM-VTON, a novel diffusion model for virtual try-on that enhances garment fidelity and authenticity by integrating semantic encoding modules and textual prompts, outperforming previous methods.
Contribution
The paper proposes IDM-VTON, a diffusion-based virtual try-on model with dual semantic encoding modules and textual prompts, improving garment detail preservation and authenticity over prior approaches.
Findings
Outperforms previous diffusion and GAN-based methods in garment detail preservation.
Enhances authenticity of virtual try-on images through semantic encoding and textual prompts.
Effective in real-world scenarios with improved fidelity and authenticity.
Abstract
This paper considers image-based virtual try-on, which renders an image of a person wearing a curated garment, given a pair of images depicting the person and the garment, respectively. Previous works adapt existing exemplar-based inpainting diffusion models for virtual try-on to improve the naturalness of the generated visuals compared to other methods (e.g., GAN-based), but they fail to preserve the identity of the garments. To overcome this limitation, we propose a novel diffusion model that improves garment fidelity and generates authentic virtual try-on images. Our method, coined IDM-VTON, uses two different modules to encode the semantics of garment image; given the base UNet of the diffusion model, 1) the high-level semantics extracted from a visual encoder are fused to the cross-attention layer, and then 2) the low-level features extracted from parallel UNet are fused to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗yisol/IDM-VTONmodel· 10k dl· ♡ 70510k dl♡ 705
- 🤗ananta11K/IDM-VTONmodel· 17 dl· ♡ 117 dl♡ 1
- 🤗yisol/IDM-VTON-DCmodel· ♡ 15♡ 15
- 🤗imaginairy/idm-vton-safetensorsmodel· 9 dl· ♡ 29 dl♡ 2
- 🤗Vijish/VTONmodel
- 🤗Roopansh/Ailusion-VTON-DEMO-v1.1model· 4 dl· ♡ 14 dl♡ 1
- 🤗MnLgt/IDM-VTONmodel· 2 dl2 dl
- 🤗ModelsLab/IDM-VTONmodel· 14 dl· ♡ 414 dl♡ 4
- 🤗rafiaashraf/tryOn_modelmodel· 23 dl· ♡ 823 dl♡ 8
- 🤗vanhieunguyen21/IDM-VTONmodel
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputer Graphics and Visualization Techniques · 3D Shape Modeling and Analysis
MethodsDiffusion · Inpainting · Balanced Selection
