OmniVTON++: Training-Free Universal Virtual Try-On with Principal Pose Guidance

Zhaotong Yang; Yong Du; Shengfeng He; Yuhui Li; Xinzhe Li; Yangyang Xu; Junyu Dong; and Jian Yang

arXiv:2602.14552·cs.CV·March 12, 2026

OmniVTON++: Training-Free Universal Virtual Try-On with Principal Pose Guidance

Zhaotong Yang, Yong Du, Shengfeng He, Yuhui Li, Xinzhe Li, Yangyang Xu, Junyu Dong, and Jian Yang

PDF

Open Access

TL;DR

OmniVTON++ is a versatile, training-free virtual try-on framework that uses principal pose guidance and boundary refinement to achieve state-of-the-art results across diverse scenarios without retraining.

Contribution

It introduces a universal, training-free VTON method combining structured garment morphing, pose guidance, and boundary stitching for broad applicability.

Findings

01

Achieves state-of-the-art performance in diverse settings.

02

Operates reliably across different datasets and garment types.

03

Supports multi-garment, multi-human, and anime character try-on.

Abstract

Image-based Virtual Try-On (VTON) concerns the synthesis of realistic person imagery through garment re-rendering under human pose and body constraints. In practice, however, existing approaches are typically optimized for specific data conditions, making their deployment reliant on retraining and limiting their generalization as a unified solution. We present OmniVTON++, a training-free VTON framework designed for universal applicability. It addresses the intertwined challenges of garment alignment, human structural coherence, and boundary continuity by coordinating Structured Garment Morphing for correspondence-driven garment adaptation, Principal Pose Guidance for step-wise structural regulation during diffusion sampling, and Continuous Boundary Stitching for boundary-aware refinement, forming a cohesive pipeline without task-specific retraining. Experimental results demonstrate that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · 3D Shape Modeling and Analysis · Face recognition and analysis