Visual SLAM with Graph-Cut Optimized Multi-Plane Reconstruction
Fangwen Shu, Yaxu Xie, Jason Rambach, Alain Pagani, Didier Stricker

TL;DR
This paper introduces a semantic planar SLAM system that enhances pose estimation and mapping from monocular images by using graph-cut optimization for geometric model fitting and an adaptive parameter strategy, addressing challenges in scene scale and data association.
Contribution
It proposes a novel graph-cut based approach for homography and plane estimation in monocular SLAM, improving robustness over traditional RANSAC methods.
Findings
Enhanced pose accuracy demonstrated on multiple datasets.
Robust geometric model fitting with graph-cut optimization.
Effective adaptive parameter strategy improves system stability.
Abstract
This paper presents a semantic planar SLAM system that improves pose estimation and mapping using cues from an instance planar segmentation network. While the mainstream approaches are using RGB-D sensors, employing a monocular camera with such a system still faces challenges such as robust data association and precise geometric model fitting. In the majority of existing work, geometric model estimation problems such as homography estimation and piece-wise planar reconstruction (PPR) are usually solved by standard (greedy) RANSAC separately and sequentially. However, setting the inlier-outlier threshold is difficult in absence of information about the scene (i.e. the scale). In this work, we revisit these problems and argue that two mentioned geometric models (homographies/3D planes) can be solved by minimizing an energy function that exploits the spatial coherence, i.e. with graph-cut…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · 3D Surveying and Cultural Heritage · Advanced Image and Video Retrieval Techniques
