TL;DR
This paper introduces a deep learning approach to transform images into a canonical appearance to improve direct visual localization robustness under changing illumination, validated on synthetic datasets and integrated into localization pipelines.
Contribution
It proposes a novel deep convolutional encoder-decoder model for appearance transformation to enhance visual localization under illumination changes.
Findings
Improved visual odometry accuracy across varying lighting conditions.
Enhanced metric relocalization performance in challenging illumination scenarios.
Demonstrated transfer learning from synthetic to real environments.
Abstract
Direct visual localization has recently enjoyed a resurgence in popularity with the increasing availability of cheap mobile computing power. The competitive accuracy and robustness of these algorithms compared to state-of-the-art feature-based methods, as well as their natural ability to yield dense maps, makes them an appealing choice for a variety of mobile robotics applications. However, direct methods remain brittle in the face of appearance change due to their underlying assumption of photometric consistency, which is commonly violated in practice. In this paper, we propose to mitigate this problem by training deep convolutional encoder-decoder models to transform images of a scene such that they correspond to a previously-seen canonical appearance. We validate our method in multiple environments and illumination conditions using high-fidelity synthetic RGB-D datasets, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
