SPARC: Sparse Render-and-Compare for CAD model alignment in a single RGB image
Florian Langer, Gwangbin Bae, Ignas Budvytis, Roberto Cipolla

TL;DR
SPARC introduces a sparse, iterative render-and-compare method leveraging transformer architectures to improve 3D CAD model alignment from a single RGB image, outperforming previous approaches in accuracy and robustness.
Contribution
The paper presents a novel sparse, iterative render-and-compare approach using transformers for more accurate CAD model alignment from a single image, surpassing prior methods.
Findings
Achieves 31.8% instance alignment accuracy on ScanNet, outperforming previous 25.0%.
Converges after just 3 iterations, demonstrating efficiency.
Combines 2D image features with 3D CAD data for improved pose prediction.
Abstract
Estimating 3D shapes and poses of static objects from a single image has important applications for robotics, augmented reality and digital content creation. Often this is done through direct mesh predictions which produces unrealistic, overly tessellated shapes or by formulating shape prediction as a retrieval task followed by CAD model alignment. Directly predicting CAD model poses from 2D image features is difficult and inaccurate. Some works, such as ROCA, regress normalised object coordinates and use those for computing poses. While this can produce more accurate pose estimates, predicting normalised object coordinates is susceptible to systematic failure. Leveraging efficient transformer architectures we demonstrate that a sparse, iterative, render-and-compare approach is more accurate and robust than relying on normalised object coordinates. For this we combine 2D image…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Anatomy and Medical Technology · Image Processing and 3D Reconstruction
