Spatio-Temporal Garment Reconstruction Using Diffusion Mapping via Pattern Coordinates

Yingxuan You; Ren Li; Corentin Dumery; Cong Cao; Hao Li; Pascal Fua

arXiv:2602.24043·cs.CV·March 2, 2026

Spatio-Temporal Garment Reconstruction Using Diffusion Mapping via Pattern Coordinates

Yingxuan You, Ren Li, Corentin Dumery, Cong Cao, Hao Li, Pascal Fua

PDF

Open Access

TL;DR

This paper introduces a novel spatio-temporal diffusion framework for high-fidelity 3D garment reconstruction from monocular images and videos, effectively capturing detailed and dynamic clothing geometry.

Contribution

It combines implicit sewing patterns with diffusion models and a mapping approach to improve garment reconstruction accuracy and temporal consistency, especially for loose-fitting clothing.

Findings

01

Outperforms existing methods on various garment types.

02

Generalizes well from synthetic training to real-world data.

03

Preserves fine geometric details and realistic motion.

Abstract

Reconstructing 3D clothed humans from monocular images and videos is a fundamental problem with applications in virtual try-on, avatar creation, and mixed reality. Despite significant progress in human body recovery, accurately reconstructing garment geometry, particularly for loose-fitting clothing, remains an open challenge. We propose a unified framework for high-fidelity 3D garment reconstruction from both single images and video sequences. Our approach combines Implicit Sewing Patterns (ISP) with a generative diffusion model to learn expressive garment shape priors in 2D UV space. Leveraging these priors, we introduce a mapping model that establishes correspondences between image pixels, UV pattern coordinates, and 3D geometry, enabling accurate and detailed garment reconstruction from single images. We further extend this formulation to dynamic reconstruction by introducing a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · 3D Shape Modeling and Analysis · Face recognition and analysis