MV-TON: Memory-based Video Virtual Try-on network

Xiaojing Zhong; Zhonghua Wu; Taizhe Tan; Guosheng Lin; Qingyao Wu

arXiv:2108.07502·cs.CV·August 18, 2021

MV-TON: Memory-based Video Virtual Try-on network

Xiaojing Zhong, Zhonghua Wu, Taizhe Tan, Guosheng Lin, Qingyao Wu

PDF

TL;DR

MV-TON is a novel memory-based network that enables high-resolution, realistic video virtual try-on without clothing templates, advancing the quality and applicability of virtual fitting systems.

Contribution

The paper introduces MV-TON, a memory-augmented framework that improves video virtual try-on by eliminating the need for clothing templates and enhancing output resolution.

Findings

01

Outperforms existing video virtual try-on methods in quality.

02

Generates high-resolution, realistic videos.

03

Effectively transfers clothes without templates.

Abstract

With the development of Generative Adversarial Network, image-based virtual try-on methods have made great progress. However, limited work has explored the task of video-based virtual try-on while it is important in real-world applications. Most existing video-based virtual try-on methods usually require clothing templates and they can only generate blurred and low-resolution results. To address these challenges, we propose a Memory-based Video virtual Try-On Network (MV-TON), which seamlessly transfers desired clothes to a target person without using any clothing templates and generates high-resolution realistic videos. Specifically, MV-TON consists of two modules: 1) a try-on module that transfers the desired clothes from model images to frame images by pose alignment and region-wise replacing of pixels; 2) a memory refinement module that learns to embed the existing generated frames…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.