DensePose 3D: Lifting Canonical Surface Maps of Articulated Objects to   the Third Dimension

Roman Shapovalov; David Novotny; Benjamin Graham; Patrick Labatut,; Andrea Vedaldi

arXiv:2109.00033·cs.CV·September 2, 2021

DensePose 3D: Lifting Canonical Surface Maps of Articulated Objects to the Third Dimension

Roman Shapovalov, David Novotny, Benjamin Graham, Patrick Labatut,, Andrea Vedaldi

PDF

Open Access

TL;DR

DensePose 3D introduces a weakly supervised method for monocular 3D reconstruction of articulated objects like humans and animals, learning from 2D annotations without requiring 3D scans, enabling broader category applicability.

Contribution

It presents a novel end-to-end approach that learns 3D reconstructions and part decompositions from 2D data, avoiding the need for large 3D datasets or parametric models.

Findings

01

Outperforms state-of-the-art non-rigid structure-from-motion methods.

02

Works effectively on both synthetic and real data.

03

Applicable to diverse articulated categories like animals and humans.

Abstract

We tackle the problem of monocular 3D reconstruction of articulated objects like humans and animals. We contribute DensePose 3D, a method that can learn such reconstructions in a weakly supervised fashion from 2D image annotations only. This is in stark contrast with previous deformable reconstruction methods that use parametric models such as SMPL pre-trained on a large dataset of 3D object scans. Because it does not require 3D scans, DensePose 3D can be used for learning a wide range of articulated categories such as different animal species. The method learns, in an end-to-end fashion, a soft partition of a given category-specific 3D template mesh into rigid parts together with a monocular reconstruction network that predicts the part motions such that they reproject correctly onto 2D DensePose-like surface annotations of the object. The decomposition of the object into parts is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications