Fusing Structure from Motion and Simulation-Augmented Pose Regression from Optical Flow for Challenging Indoor Environments
Felix Ott, Lucas Heublein, David R\"ugamer, Bernd Bischl, Christopher, Mutschler

TL;DR
This paper introduces a recurrent fusion network approach to improve indoor object localization by combining structure from motion and simulation-augmented pose regression, addressing environmental challenges like motion blur and lighting changes.
Contribution
It proposes a novel recurrent fusion network for better alignment of absolute and relative poses, trained in simulation and tested in real-world warehouse scenarios.
Findings
Recurrent fusion networks outperform pose graph optimization in pose accuracy.
Simulation-based pre-training enhances generalization to real environments.
The method effectively handles challenging indoor conditions.
Abstract
The localization of objects is a crucial task in various applications such as robotics, virtual and augmented reality, and the transportation of goods in warehouses. Recent advances in deep learning have enabled the localization using monocular visual cameras. While structure from motion (SfM) predicts the absolute pose from a point cloud, absolute pose regression (APR) methods learn a semantic understanding of the environment through neural networks. However, both fields face challenges caused by the environment such as motion blur, lighting changes, repetitive patterns, and feature-less structures. This study aims to address these challenges by incorporating additional information and regularizing the absolute pose using relative pose regression (RPR) methods. RPR methods suffer under different challenges, i.e., motion blur. The optical flow between consecutive images is computed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · 3D Surveying and Cultural Heritage · Advanced Vision and Imaging
MethodsALIGN
