Have We Mastered Scale in Deep Monocular Visual SLAM? The ScaleMaster Dataset and Benchmark
Hyoseok Ju, Bokeon Suh, and Giseop Kim

TL;DR
This paper introduces the ScaleMaster Dataset and benchmark to evaluate and analyze the robustness of deep monocular visual SLAM systems in large-scale indoor environments, highlighting their scale inconsistency issues.
Contribution
The paper presents the first benchmark specifically designed to assess scale consistency in challenging large-scale indoor SLAM scenarios, along with a systematic analysis of current systems' vulnerabilities.
Findings
Deep SLAM systems perform well on existing benchmarks but struggle with scale in large environments.
Scale inconsistency causes significant failures in realistic indoor scenarios.
The new benchmark enables more comprehensive evaluation of scale robustness.
Abstract
Recent advances in deep monocular visual Simultaneous Localization and Mapping (SLAM) have achieved impressive accuracy and dense reconstruction capabilities, yet their robustness to scale inconsistency in large-scale indoor environments remains largely unexplored. Existing benchmarks are limited to room-scale or structurally simple settings, leaving critical issues of intra-session scale drift and inter-session scale ambiguity insufficiently addressed. To fill this gap, we introduce the ScaleMaster Dataset, the first benchmark explicitly designed to evaluate scale consistency under challenging scenarios such as multi-floor structures, long trajectories, repetitive views, and low-texture regions. We systematically analyze the vulnerability of state-of-the-art deep monocular visual SLAM systems to scale inconsistency, providing both quantitative and qualitative evaluations. Crucially,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Vision and Imaging · Advanced Neural Network Applications
