TL;DR
This paper introduces a novel Differentiable Semantic Rendering loss that leverages clothing semantics to improve 3D human body shape and pose estimation from monocular images, especially in clothed scenarios.
Contribution
It proposes a new DSR loss with clothing-aware semantics and learns a clothing prior for SMPL vertices, enhancing 3D human reconstruction accuracy.
Findings
Outperforms previous state-of-the-art on 3DPW and Human3.6M datasets.
Achieves comparable results on MPI-INF-3DHP.
Demonstrates the effectiveness of clothing semantics in 3D human pose and shape estimation.
Abstract
Learning to regress 3D human body shape and pose (e.g.~SMPL parameters) from monocular images typically exploits losses on 2D keypoints, silhouettes, and/or part-segmentation when 3D training data is not available. Such losses, however, are limited because 2D keypoints do not supervise body shape and segmentations of people in clothing do not match projected minimally-clothed SMPL shapes. To exploit richer image information about clothed people, we introduce higher-level semantic information about clothing to penalize clothed and non-clothed regions of the image differently. To do so, we train a body regressor using a novel Differentiable Semantic Rendering - DSR loss. For Minimally-Clothed regions, we define the DSR-MC loss, which encourages a tight match between a rendered SMPL body and the minimally-clothed regions of the image. For clothed regions, we define the DSR-C loss to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
