DirectTryOn: One-Step Virtual Try-On via Straightened Conditional Transport

Xianbing Sun; Jiahui Zhan; Liqing Zhang; Jianfu Zhang

arXiv:2605.12939·cs.CV·May 14, 2026

DirectTryOn: One-Step Virtual Try-On via Straightened Conditional Transport

Xianbing Sun, Jiahui Zhan, Liqing Zhang, Jianfu Zhang

PDF

TL;DR

DirectTryOn introduces a one-step virtual try-on method that leverages the constrained structure of the task to achieve high-quality results efficiently, reducing inference cost significantly.

Contribution

The paper proposes a novel one-step VTON approach using straightened conditional transport and introduces specific losses and distillation to align pretrained models with task constraints.

Findings

01

Achieves state-of-the-art results with one-step sampling.

02

Reduces inference cost compared to multi-step diffusion and flow-based methods.

03

Demonstrates high-quality virtual try-on performance.

Abstract

Recent diffusion- and flow-based VTON methods achieve strong results with pretrained generative models, but their reliance on multi-step sampling incurs high inference cost, while existing acceleration methods largely overlook the intrinsic structure of the try-on task. In this paper, we highlight a key observation: VTON outputs are highly constrained by the conditional inputs, suggesting that the conditional sampling trajectory can be much straighter than that in general image generation, making one-step generation a natural solution. However, limited task-specific data makes training from scratch impractical, forcing existing methods to fine-tune pretrained models whose objectives do not encourage such straight conditional trajectories. Thus, the deviation from an ideal straight path mainly comes from the mismatch between pretrained base models and the conditional nature of try-on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.