ODPG: Outfitting Diffusion with Pose Guided Condition
Seohyun Lee, Jintae Park, Sanghyeok Park

TL;DR
ODPG introduces a pose-guided diffusion model for virtual try-on that produces realistic, detailed images of garments on humans in various poses without explicit garment warping.
Contribution
It presents a novel latent diffusion approach with multiple conditioning inputs for improved virtual try-on realism and pose handling.
Findings
Generates realistic VTON images with fine details across poses
Operates end-to-end without explicit garment warping
Demonstrates effectiveness on FashionTryOn and DeepFashion datasets
Abstract
Virtual Try-On (VTON) technology allows users to visualize how clothes would look on them without physically trying them on, gaining traction with the rise of digitalization and online shopping. Traditional VTON methods, often using Generative Adversarial Networks (GANs) and Diffusion models, face challenges in achieving high realism and handling dynamic poses. This paper introduces Outfitting Diffusion with Pose Guided Condition (ODPG), a novel approach that leverages a latent diffusion model with multiple conditioning inputs during the denoising process. By transforming garment, pose, and appearance images into latent features and integrating these features in a UNet-based denoising model, ODPG achieves non-explicit synthesis of garments on dynamically posed human images. Our experiments on the FashionTryOn and a subset of the DeepFashion dataset demonstrate that ODPG generates…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSoftmax · Attention Is All You Need · Diffusion · Latent Diffusion Model · Focus
