Towards Squeezing-Averse Virtual Try-On via Sequential Deformation
Sang-Heon Shim, Jiwoo Chung, Jae-Pil Heo

TL;DR
This paper introduces SD-VITON, a sequential deformation approach that disentangles and refines appearance flow predictions to eliminate squeezing artifacts in high-resolution virtual try-on images.
Contribution
The paper proposes a novel sequential deformation framework that separates and refines appearance flow layers to improve visual quality in virtual try-on systems.
Findings
Reduces squeezing artifacts in virtual try-on images.
Outperforms baseline methods in visual quality.
Effectively disentangles deformation layers for better results.
Abstract
In this paper, we first investigate a visual quality degradation problem observed in recent high-resolution virtual try-on approach. The tendency is empirically found that the textures of clothes are squeezed at the sleeve, as visualized in the upper row of Fig.1(a). A main reason for the issue arises from a gradient conflict between two popular losses, the Total Variation (TV) and adversarial losses. Specifically, the TV loss aims to disconnect boundaries between the sleeve and torso in a warped clothing mask, whereas the adversarial loss aims to combine between them. Such contrary objectives feedback the misaligned gradients to a cascaded appearance flow estimation, resulting in undesirable squeezing artifacts. To reduce this, we propose a Sequential Deformation (SD-VITON) that disentangles the appearance flow prediction layers into TV objective-dominant (TVOB) layers and a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Image Processing Techniques · Advanced Vision and Imaging
