Learning monocular 3D reconstruction of articulated categories from   motion

Filippos Kokkinos; Iasonas Kokkinos

arXiv:2103.16352·cs.CV·April 28, 2021

Learning monocular 3D reconstruction of articulated categories from motion

Filippos Kokkinos, Iasonas Kokkinos

PDF

TL;DR

This paper presents a novel self-supervised approach for monocular 3D reconstruction of articulated objects using motion consistency, an interpretable deformation model, and joint optimization, achieving state-of-the-art results with minimal supervision.

Contribution

It introduces a motion-based cycle loss for self-supervision, an interpretable mesh deformation model with local handles, and a joint optimization method for improved 3D reconstruction accuracy.

Findings

01

Achieves state-of-the-art 3D reconstructions for multiple categories.

02

Uses minimal video data for supervision, reducing data requirements.

03

Enhances reconstruction accuracy through joint mesh and camera optimization.

Abstract

Monocular 3D reconstruction of articulated object categories is challenging due to the lack of training data and the inherent ill-posedness of the problem. In this work we use video self-supervision, forcing the consistency of consecutive 3D reconstructions by a motion-based cycle loss. This largely improves both optimization-based and learning-based 3D mesh reconstruction. We further introduce an interpretable model of 3D template deformations that controls a 3D surface through the displacement of a small number of local, learnable handles. We formulate this operation as a structured layer relying on mesh-laplacian regularization and show that it can be trained in an end-to-end manner. We finally introduce a per-sample numerical optimisation approach that jointly optimises over mesh displacements and cameras within a video, boosting accuracy both for training and also as test time…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.