TL;DR
OMFA introduces a unified, mask-free diffusion framework enabling flexible, pose-agnostic virtual try-on and try-off from a single portrait, surpassing previous methods in practicality and generalization.
Contribution
The paper proposes OMFA, a novel bidirectional Tweedie diffusion model that supports arbitrary pose transfer and cross-person garment transfer without exhibition garments or segmentation masks.
Findings
Achieves state-of-the-art results on try-on and try-off tasks.
Supports multi-view and arbitrary-pose try-on from a single image.
Operates without exhibition garments, enabling practical garment transfer.
Abstract
Recent diffusion-based approaches have made significant advances in image-based virtual try-on, enabling more realistic and end-to-end garment synthesis. However, most existing methods remain constrained by their reliance on exhibition garments and segmentation masks, as well as their limited ability to handle flexible pose variations. These limitations reduce their practicality in real-world scenarios; for instance, users cannot easily transfer garments worn by one person onto another, and the generated try-on results are typically restricted to the same pose as the reference image. In this paper, we introduce OMFA (One Model For All), a unified diffusion framework for both virtual try-on and try-off that operates without the need for exhibition garments and supports arbitrary poses. OMFA is inspired by the mask-based paradigm of discrete diffusion language models and unifies try-on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
