OutfitAnyone: Ultra-high Quality Virtual Try-On for Any Clothing and Any Person
Ke Sun, Jian Cao, Qi Wang, Linrui Tian, Xindi Zhang, Lian Zhuo, Bang, Zhang, Liefeng Bo, Wenbo Zhou, Weiming Zhang, Daiheng Gao

TL;DR
OutfitAnyone introduces a two-stream diffusion model that significantly enhances virtual try-on quality, enabling realistic, detailed, and scalable clothing simulations across diverse scenarios and user body types.
Contribution
The paper presents a novel two-stream conditional diffusion approach that improves garment deformation handling and scalability in virtual try-on applications.
Findings
Achieves high-fidelity, detail-consistent virtual try-on results
Handles diverse poses, body shapes, and clothing styles effectively
Demonstrates scalability from anime to real-world images
Abstract
Virtual Try-On (VTON) has become a transformative technology, empowering users to experiment with fashion without ever having to physically try on clothing. However, existing methods often struggle with generating high-fidelity and detail-consistent results. While diffusion models, such as Stable Diffusion series, have shown their capability in creating high-quality and photorealistic images, they encounter formidable challenges in conditional generation scenarios like VTON. Specifically, these models struggle to maintain a balance between control and consistency when generating images for virtual clothing trials. OutfitAnyone addresses these limitations by leveraging a two-stream conditional diffusion model, enabling it to adeptly handle garment deformation for more lifelike results. It distinguishes itself with scalability-modulating factors such as pose, body shape and broad…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsDiffusion
