MV-Fashion: Towards Enabling Virtual Try-On and Size Estimation with Multi-View Paired Data
Hunor Laczk\'o, Libang Jia, Loc-Phat Truong, Diego Hern\'andez, Sergio Escalera, Jordi Gonzalez, Meysam Madadi

TL;DR
MV-Fashion is a comprehensive multi-view video dataset designed to advance fashion analysis tasks like virtual try-on and size estimation by providing realistic garment dynamics, detailed annotations, and paired data.
Contribution
The paper introduces MV-Fashion, a large-scale dataset with multi-view videos, detailed annotations, and paired images, specifically addressing limitations of existing datasets for fashion research.
Findings
Established baseline results for virtual try-on and size estimation.
Demonstrated the dataset's effectiveness in capturing complex garment dynamics.
Provided a new resource for advancing fashion-centric computer vision tasks.
Abstract
Existing 4D human datasets fall short for fashion-specific research, lacking either realistic garment dynamics or task-specific annotations. Synthetic datasets suffer from a realism gap, whereas real-world captures lack the detailed annotations and paired data required for virtual try-on (VTON) and size estimation tasks. To bridge this gap, we introduce MV-Fashion, a large-scale, multi-view video dataset engineered for domain-specific fashion analysis. MV-Fashion features 3,273 sequences (72.5 million frames) from 80 diverse subjects wearing 3-10 outfits each. It is designed to capture complex, real-world garment dynamics, including multiple layers and varied styling (e.g. rolled sleeves, tucked shirt). A core contribution is a rich data representation that includes pixel-level semantic annotations, ground-truth material properties like elasticity, and 3D point clouds. Crucially for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Generative Adversarial Networks and Image Synthesis · Face recognition and analysis
