MFP-VTON: Enhancing Mask-Free Person-to-Person Virtual Try-On via Diffusion Transformer
Le Shen, Yanting Kang, Rong Huang, Zhijie Wang

TL;DR
MFP-VTON introduces a mask-free person-to-person virtual try-on framework that leverages a diffusion transformer and a novel dataset, enabling high-fidelity fitting image generation without requiring standard garments.
Contribution
The paper presents a novel mask-free VTON framework using a diffusion transformer and a specialized dataset, improving ease of use and image quality in person-to-person virtual try-on.
Findings
Outperforms existing methods in fidelity and realism.
Effectively emphasizes reference garments with Focus Attention loss.
Achieves high-quality results in both person-to-person and garment-to-person VTON tasks.
Abstract
The garment-to-person virtual try-on (VTON) task, which aims to generate fitting images of a person wearing a reference garment, has made significant strides. However, obtaining a standard garment is often more challenging than using the garment already worn by the person. To improve ease of use, we propose MFP-VTON, a Mask-Free framework for Person-to-Person VTON. Recognizing the scarcity of person-to-person data, we adapt a garment-to-person model and dataset to construct a specialized dataset for this task. Our approach builds upon a pretrained diffusion transformer, leveraging its strong generative capabilities. During mask-free model fine-tuning, we introduce a Focus Attention loss to emphasize the garment of the reference person and the details outside the garment of the target person. Experimental results demonstrate that our model excels in both person-to-person and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Virtual Reality Applications and Impacts · Teleoperation and Haptic Systems
