Vid2CAD: CAD Model Alignment using Multi-View Constraints from Videos

Kevis-Kokitsi Maninis; Stefan Popov; Matthias Nie{\ss}ner; Vittorio; Ferrari

arXiv:2012.04641·cs.CV·January 26, 2022

Vid2CAD: CAD Model Alignment using Multi-View Constraints from Videos

Kevis-Kokitsi Maninis, Stefan Popov, Matthias Nie{\ss}ner, Vittorio, Ferrari

PDF

1 Repo

TL;DR

This paper introduces Vid2CAD, a method that automatically aligns CAD models to complex scenes in videos by integrating neural predictions with multi-view constraints, improving accuracy and handling occlusions.

Contribution

It presents a novel multi-view constraint optimization approach that enhances CAD model alignment accuracy in videos, surpassing previous single-frame methods.

Findings

01

Significant accuracy improvement over Mask2CAD (from 11.6% to 30.7%).

02

Effective handling of occlusions and out-of-view objects.

03

Automatic recovery of 9 DoF poses for multiple objects.

Abstract

We address the task of aligning CAD models to a video sequence of a complex scene containing multiple objects. Our method can process arbitrary videos and fully automatically recover the 9 DoF pose for each object appearing in it, thus aligning them in a common 3D coordinate frame. The core idea of our method is to integrate neural network predictions from individual frames with a temporally global, multi-view constraint optimization formulation. This integration process resolves the scale and depth ambiguities in the per-frame predictions, and generally improves the estimate of all pose parameters. By leveraging multi-view constraints, our method also resolves occlusions and handles objects that are out of view in individual frames, thus reconstructing all objects into a single globally consistent CAD representation of the scene. In comparison to the state-of-the-art single-frame…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

likojack/odam
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.