High-Fidelity Virtual Try-on with Large-Scale Unpaired Learning
Han Yang, Yanlong Zang, Ziwei Liu

TL;DR
This paper introduces BVTON, a novel virtual try-on framework that leverages large-scale unpaired fashion images and pseudo pair construction to achieve high-fidelity clothing transfer in virtual try-on applications.
Contribution
The paper proposes a new unpaired learning framework with a compositional canonicalizing flow and layered mask generation, enabling high-fidelity virtual try-on without paired training data.
Findings
Outperforms state-of-the-art methods in high-resolution datasets
Demonstrates strong generalizability across various styles and data sources
Achieves detailed and realistic clothing transfer results
Abstract
Virtual try-on (VTON) transfers a target clothing image to a reference person, where clothing fidelity is a key requirement for downstream e-commerce applications. However, existing VTON methods still fall short in high-fidelity try-on due to the conflict between the high diversity of dressing styles (\eg clothes occluded by pants or distorted by posture) and the limited paired data for training. In this work, we propose a novel framework \textbf{Boosted Virtual Try-on (BVTON)} to leverage the large-scale unpaired learning for high-fidelity try-on. Our key insight is that pseudo try-on pairs can be reliably constructed from vastly available fashion images. Specifically, \textbf{1)} we first propose a compositional canonicalizing flow that maps on-model clothes into pseudo in-shop clothes, dubbed canonical proxy. Each clothing part (sleeves, torso) is reversely deformed into an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Remote Desktop Technologies
