DensePose 3D: Lifting Canonical Surface Maps of Articulated Objects to the Third Dimension
Roman Shapovalov, David Novotny, Benjamin Graham, Patrick Labatut,, Andrea Vedaldi

TL;DR
DensePose 3D introduces a weakly supervised method for monocular 3D reconstruction of articulated objects like humans and animals, learning from 2D annotations without requiring 3D scans, enabling broader category applicability.
Contribution
It presents a novel end-to-end approach that learns 3D reconstructions and part decompositions from 2D data, avoiding the need for large 3D datasets or parametric models.
Findings
Outperforms state-of-the-art non-rigid structure-from-motion methods.
Works effectively on both synthetic and real data.
Applicable to diverse articulated categories like animals and humans.
Abstract
We tackle the problem of monocular 3D reconstruction of articulated objects like humans and animals. We contribute DensePose 3D, a method that can learn such reconstructions in a weakly supervised fashion from 2D image annotations only. This is in stark contrast with previous deformable reconstruction methods that use parametric models such as SMPL pre-trained on a large dataset of 3D object scans. Because it does not require 3D scans, DensePose 3D can be used for learning a wide range of articulated categories such as different animal species. The method learns, in an end-to-end fashion, a soft partition of a given category-specific 3D template mesh into rigid parts together with a monocular reconstruction network that predicts the part motions such that they reproject correctly onto 2D DensePose-like surface annotations of the object. The decomposition of the object into parts is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
