TL;DR
C3DPO introduces a deep learning approach for reconstructing 3D models of deformable objects from 2D keypoints in unconstrained images, effectively handling occlusions and viewpoint variations without requiring ground-truth 3D data.
Contribution
It presents a novel regularization technique and a canonicalization function for factorizing shape, viewpoint, and deformation in 3D reconstruction from single images.
Findings
Achieves state-of-the-art results on Up3D and PASCAL3D+ benchmarks.
Does not rely on ground-truth 3D supervision.
Successfully handles partial occlusions and viewpoint changes.
Abstract
We propose C3DPO, a method for extracting 3D models of deformable objects from 2D keypoint annotations in unconstrained images. We do so by learning a deep network that reconstructs a 3D object from a single view at a time, accounting for partial occlusions, and explicitly factoring the effects of viewpoint changes and object deformations. In order to achieve this factorization, we introduce a novel regularization technique. We first show that the factorization is successful if, and only if, there exists a certain canonicalization function of the reconstructed shapes. Then, we learn the canonicalization function together with the reconstruction one, which constrains the result to be consistent. We demonstrate state-of-the-art reconstruction results for methods that do not use ground-truth 3D supervision for a number of benchmarks, including Up3D and PASCAL3D+. Source code has been made…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
