SIM-Sync: From Certifiably Optimal Synchronization over the 3D Similarity Group to Scene Reconstruction with Learned Depth
Xihang Yu, Heng Yang

TL;DR
SIM-Sync introduces a certifiably optimal algorithm that jointly estimates camera poses and scene structure from multiview images using learned depth, bridging pose graph optimization and bundle adjustment.
Contribution
The paper proposes a novel certifiably optimal method for multiview scene reconstruction that integrates learned depth prediction with synchronization over the 3D similarity group.
Findings
SIM-Sync performs comparably to Ceres in bundle adjustment tasks.
It is robust to outliers when combined with robust estimators.
Achieves similar results to ORB-SLAM3 on real datasets.
Abstract
This paper presents SIM-Sync, a certifiably optimal algorithm that estimates camera trajectory and 3D scene structure directly from multiview image keypoints. SIM-Sync fills the gap between pose graph optimization and bundle adjustment; the former admits efficient global optimization but requires relative pose measurements and the latter directly consumes image keypoints but is difficult to optimize globally (due to camera projective geometry). The bridge to this gap is a pretrained depth prediction network. Given a graph with nodes representing monocular images taken at unknown camera poses and edges containing pairwise image keypoint correspondences, SIM-Sync first uses a pretrained depth prediction network to lift the 2D keypoints into 3D scaled point clouds, where the scaling of the per-image point cloud is unknown due to the scale ambiguity in monocular depth prediction. SIM-Sync…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Optical measurement and interference techniques · Robotics and Sensor-Based Localization
