PEMF-VTO: Point-Enhanced Video Virtual Try-on via Mask-free Paradigm
Tianyu Chang, Xiaohao Chen, Zhichao Wei, Xuanpu Zhang, Qing-Guo Chen, Weihua Luo, Peipei Song, Xun Yang

TL;DR
PEMF-VTO introduces a mask-free video virtual try-on method that uses sparse point alignments and a novel transformer architecture to improve garment transfer accuracy and temporal coherence in complex, real-world videos.
Contribution
The paper presents a new Point-Enhanced Transformer framework that guides garment transfer using sparse point alignments, overcoming limitations of mask-based methods in dynamic scenes.
Findings
Outperforms state-of-the-art methods in naturalness and coherence
Effective in complex in-the-wild scenarios
Enhances temporal stability in try-on videos
Abstract
Video Virtual Try-on aims to seamlessly transfer a reference garment onto a target person in a video while preserving both visual fidelity and temporal coherence. Existing methods typically rely on inpainting masks to define the try-on area, enabling accurate garment transfer for simple scenes (e.g., in-shop videos). However, these mask-based approaches struggle with complex real-world scenarios, as overly large and inconsistent masks often destroy spatial-temporal information, leading to distorted results. Mask-free methods alleviate this issue but face challenges in accurately determining the try-on area, especially for videos with dynamic body movements. To address these limitations, we propose PEMF-VTO, a novel Point-Enhanced Mask-Free Video Virtual Try-On framework that leverages sparse point alignments to explicitly guide garment transfer. Our key innovation is the introduction of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Computer Graphics and Visualization Techniques · Advanced Image Processing Techniques
MethodsSoftmax · Attention Is All You Need · Inpainting
