MV-Fashion: Towards Enabling Virtual Try-On and Size Estimation with Multi-View Paired Data

Hunor Laczk\'o; Libang Jia; Loc-Phat Truong; Diego Hern\'andez; Sergio Escalera; Jordi Gonzalez; Meysam Madadi

arXiv:2603.08147·cs.CV·March 10, 2026

MV-Fashion: Towards Enabling Virtual Try-On and Size Estimation with Multi-View Paired Data

Hunor Laczk\'o, Libang Jia, Loc-Phat Truong, Diego Hern\'andez, Sergio Escalera, Jordi Gonzalez, Meysam Madadi

PDF

Open Access

TL;DR

MV-Fashion is a comprehensive multi-view video dataset designed to advance fashion analysis tasks like virtual try-on and size estimation by providing realistic garment dynamics, detailed annotations, and paired data.

Contribution

The paper introduces MV-Fashion, a large-scale dataset with multi-view videos, detailed annotations, and paired images, specifically addressing limitations of existing datasets for fashion research.

Findings

01

Established baseline results for virtual try-on and size estimation.

02

Demonstrated the dataset's effectiveness in capturing complex garment dynamics.

03

Provided a new resource for advancing fashion-centric computer vision tasks.

Abstract

Existing 4D human datasets fall short for fashion-specific research, lacking either realistic garment dynamics or task-specific annotations. Synthetic datasets suffer from a realism gap, whereas real-world captures lack the detailed annotations and paired data required for virtual try-on (VTON) and size estimation tasks. To bridge this gap, we introduce MV-Fashion, a large-scale, multi-view video dataset engineered for domain-specific fashion analysis. MV-Fashion features 3,273 sequences (72.5 million frames) from 80 diverse subjects wearing 3-10 outfits each. It is designed to capture complex, real-world garment dynamics, including multiple layers and varied styling (e.g. rolled sleeves, tucked shirt). A core contribution is a rich data representation that includes pixel-level semantic annotations, ground-truth material properties like elasticity, and 3D point clouds. Crucially for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · Generative Adversarial Networks and Image Synthesis · Face recognition and analysis