TL;DR
ManhattanSLAM is a robust RGB-D SLAM system that accurately tracks and maps indoor scenes by leveraging a novel mixture of Manhattan Frames, improving performance in both MW and non-MW environments.
Contribution
The paper introduces a new approach that detects Manhattan Frames in scenes, decouples pose estimation in MW scenes, and combines features for robust tracking and dense mapping.
Findings
Outperforms state-of-the-art methods in pose estimation accuracy.
Achieves low-drift tracking in diverse indoor environments.
Provides efficient dense mapping using plane and non-plane features.
Abstract
In this paper, a robust RGB-D SLAM system is proposed to utilize the structural information in indoor scenes, allowing for accurate tracking and efficient dense mapping on a CPU. Prior works have used the Manhattan World (MW) assumption to estimate low-drift camera pose, in turn limiting the applications of such systems. This paper, in contrast, proposes a novel approach delivering robust tracking in MW and non-MW environments. We check orthogonal relations between planes to directly detect Manhattan Frames, modeling the scene as a Mixture of Manhattan Frames. For MW scenes, we decouple pose estimation and provide a novel drift-free rotation estimation based on Manhattan Frame observations. For translation estimation in MW scenes and full camera pose estimation in non-MW scenes, we make use of point, line and plane features for robust tracking in challenging scenes. %mapping…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
