Marr Revisited: 2D-3D Alignment via Surface Normal Prediction
Aayush Bansal, Bryan Russell, Abhinav Gupta

TL;DR
This paper presents a novel method for 3D model retrieval from 2D images by predicting surface normals with a deep learning model, enabling improved pose and style matching with CAD models.
Contribution
It introduces a skip-network CNN for accurate surface normal prediction and a two-stream network that jointly learns pose and style for enhanced CAD model retrieval.
Findings
Achieves state-of-the-art surface normal prediction accuracy on NYUv2 dataset.
Matches prior work in pose prediction when using predicted normals.
Provides better style and pose matching for CAD retrieval compared to baselines.
Abstract
We introduce an approach that leverages surface normal predictions, along with appearance cues, to retrieve 3D models for objects depicted in 2D still images from a large CAD object library. Critical to the success of our approach is the ability to recover accurate surface normals for objects in the depicted scene. We introduce a skip-network model built on the pre-trained Oxford VGG convolutional neural network (CNN) for surface normal prediction. Our model achieves state-of-the-art accuracy on the NYUv2 RGB-D dataset for surface normal prediction, and recovers fine object detail compared to previous methods. Furthermore, we develop a two-stream network over the input image and predicted surface normals that jointly learns pose and style for CAD model retrieval. When using the predicted surface normals, our two-stream network matches prior work using surface normals computed from RGB-D…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
