Estimation of Absolute Scale in Monocular SLAM Using Synthetic Data
Danila Rukhovich, Daniel Mouritzen, Ralf Kaestner, Martin Rufli,, Alexander Velizhev

TL;DR
This paper introduces a neural network-based method for estimating absolute scale in monocular SLAM using only synthetic data for training, achieving accuracy comparable to real data training and enabling real-time applications.
Contribution
It demonstrates that synthetic data can effectively train scale estimation networks for monocular SLAM, reducing reliance on real annotated datasets.
Findings
Synthetic training data yields comparable accuracy to real data.
The method operates effectively on low-resolution images.
Unsupervised domain adaptation mitigates appearance differences.
Abstract
This paper addresses the problem of scale estimation in monocular SLAM by estimating absolute distances between camera centers of consecutive image frames. These estimates would improve the overall performance of classical (not deep) SLAM systems and allow metric feature locations to be recovered from a single monocular camera. We propose several network architectures that lead to an improvement of scale estimation accuracy over the state of the art. In addition, we exploit a possibility to train the neural network only with synthetic data derived from a computer graphics simulator. Our key insight is that, using only synthetic training inputs, we can achieve similar scale estimation accuracy as that obtained from real data. This fact indicates that fully annotated simulated data is a viable alternative to existing deep-learning-based SLAM systems trained on real (unlabeled) data. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
