TL;DR
This paper introduces a causality-aware self-supervised method for reconstructing 3D clothing from a single image without needing 3D ground-truth data, addressing key challenges in modeling non-rigid objects.
Contribution
It proposes a novel explainable causal map and optimization framework that disentangles shape, texture, camera, and illumination in 3D clothing reconstruction.
Findings
Achieves high-fidelity 3D clothing reconstruction on fashion benchmarks
Demonstrates scalability on a bird dataset
Outperforms existing methods in handling non-rigid objects
Abstract
This research aims to study a self-supervised 3D clothing reconstruction method, which recovers the geometry shape and texture of human clothing from a single image. Compared with existing methods, we observe that three primary challenges remain: (1) 3D ground-truth meshes of clothing are usually inaccessible due to annotation difficulties and time costs; (2) Conventional template-based methods are limited to modeling non-rigid objects, e.g., handbags and dresses, which are common in fashion images; (3) The inherent ambiguity compromises the model training, such as the dilemma between a large shape with a remote camera or a small shape with a close camera. In an attempt to address the above limitations, we propose a causality-aware self-supervised learning method to adaptively reconstruct 3D non-rigid objects from 2D images without 3D annotations. In particular, to solve the inherent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
