Do Not Mask What You Do Not Need to Mask: a Parser-Free Virtual Try-On
Thibaut Issenhuth, J\'er\'emie Mary, Cl\'ement Calauz\`enes

TL;DR
This paper introduces a parser-free, real-time virtual try-on method using a student-teacher framework that avoids error-prone pre-processing steps, improving efficiency and accuracy in cloth fitting tasks.
Contribution
It proposes a novel parser-free approach with a student-teacher paradigm, eliminating the need for human parsers and pose estimators at inference time.
Findings
Enables real-time virtual try-on without human parsers.
Achieves comparable or better accuracy than parser-based methods.
Reduces pre-processing complexity and inference time.
Abstract
The 2D virtual try-on task has recently attracted a great interest from the research community, for its direct potential applications in online shopping as well as for its inherent and non-addressed scientific challenges. This task requires fitting an in-shop cloth image on the image of a person, which is highly challenging because it involves cloth warping, image compositing, and synthesizing. Casting virtual try-on into a supervised task faces a difficulty: available datasets are composed of pairs of pictures (cloth, person wearing the cloth). Thus, we have no access to ground-truth when the cloth on the person changes. State-of-the-art models solve this by masking the cloth information on the person with both a human parser and a pose estimator. Then, image synthesis modules are trained to reconstruct the person image from the masked person image and the cloth image. This procedure…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
