Mask2CAD: 3D Shape Prediction by Learning to Segment and Retrieve
Weicheng Kuo, Anelia Angelova, Tsung-Yi Lin, Angela Dai

TL;DR
Mask2CAD introduces a method that detects objects in images, retrieves the most similar 3D CAD models and poses, creating a lightweight, accurate 3D representation that bridges real-world images and synthetic models.
Contribution
The paper presents a novel joint embedding space for image regions and CAD models, enabling effective retrieval and pose estimation for 3D shape prediction from images.
Findings
Outperforms state-of-the-art on Pix3D dataset
Provides a new image-to-3D baseline with diverse shapes and occlusions
Demonstrates effective real-world 3D shape retrieval and pose estimation
Abstract
Object recognition has seen significant progress in the image domain, with focus primarily on 2D perception. We propose to leverage existing large-scale datasets of 3D models to understand the underlying 3D structure of objects seen in an image by constructing a CAD-based representation of the objects and their poses. We present Mask2CAD, which jointly detects objects in real-world images and for each detected object, optimizes for the most similar CAD model and its pose. We construct a joint embedding space between the detected regions of an image corresponding to an object and 3D CAD models, enabling retrieval of CAD models for an input RGB image. This produces a clean, lightweight representation of the objects in an image; this CAD-based representation ensures a valid, efficient shape representation for applications such as content creation or interactive scenarios, and makes a step…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · 3D Surveying and Cultural Heritage · Human Pose and Action Recognition
