TL;DR
CatVTON is a lightweight, efficient diffusion-based virtual try-on model that simplifies the process by removing redundant modules and preprocessing, achieving high-quality results with fewer parameters and less memory usage.
Contribution
This work introduces CatVTON, a novel virtual try-on diffusion model that reduces complexity and training requirements by concatenating inputs and removing unnecessary components.
Findings
Achieves superior qualitative and quantitative results.
Requires only 49.57M training parameters.
Reduces over 49% memory usage during inference.
Abstract
Virtual try-on methods based on diffusion models achieve realistic effects but often require additional encoding modules, a large number of training parameters, and complex preprocessing, which increases the burden on training and inference. In this work, we re-evaluate the necessity of additional modules and analyze how to improve training efficiency and reduce redundant steps in the inference process. Based on these insights, we propose CatVTON, a simple and efficient virtual try-on diffusion model that transfers in-shop or worn garments of arbitrary categories to target individuals by concatenating them along spatial dimensions as inputs of the diffusion model. The efficiency of CatVTON is reflected in three aspects: (1) Lightweight network. CatVTON consists only of a VAE and a simplified denoising UNet, removing redundant image and text encoders as well as cross-attentions, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗zhengchong/CatVTONmodel· 1.9k dl· ♡ 721.9k dl♡ 72
- 🤗abubakar123456/CatVTONmodel
- 🤗91stefan/CatVTONmodel· ♡ 2♡ 2
- 🤗zhengchong/CatVTON-MaskFreemodel· 35 dl· ♡ 1335 dl♡ 13
- 🤗camenduru/CatVTONmodel· ♡ 1♡ 1
- 🤗annh/cvtmodel
- 🤗xiaozaa/catvton-flux-alphamodel· 289 dl· ♡ 47289 dl♡ 47
- 🤗xiaozaa/cat-tryoff-fluxmodel· 644 dl· ♡ 18644 dl♡ 18
- 🤗xiaozaa/catvton-flux-betamodel· 461 dl· ♡ 16461 dl♡ 16
- 🤗kedarphadnis10/CatVTONmodel
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsDiffusion
