Enhancing Virtual Try-On with Synthetic Pairs and Error-Aware Noise Scheduling
Nannan Li, Kevin J. Shih, Bryan A. Plummer

TL;DR
This paper improves virtual try-on by generating synthetic training pairs and applying an error-aware refinement process, leading to more realistic and accurate try-on images with higher user preference.
Contribution
It introduces a garment extraction model for synthetic data generation and a novel error-aware refinement method using Schr"odinger Bridge for enhanced image quality.
Findings
Synthetic data augmentation improves try-on performance.
EARSB refinement enhances image realism.
Users prefer our method in 59% of cases.
Abstract
Given an isolated garment image in a canonical product view and a separate image of a person, the virtual try-on task aims to generate a new image of the person wearing the target garment. Prior virtual try-on works face two major challenges in achieving this goal: a) the paired (human, garment) training data has limited availability; b) generating textures on the human that perfectly match that of the prompted garment is difficult, often resulting in distorted text and faded textures. Our work explores ways to tackle these issues through both synthetic data as well as model refinement. We introduce a garment extraction model that generates (human, synthetic garment) pairs from a single image of a clothed individual. The synthetic pairs can then be used to augment the training of virtual try-on. We also propose an Error-Aware Refinement-based Schr\"odinger Bridge (EARSB) that surgically…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmbedded Systems Design Techniques
MethodsBalanced Selection
