TL;DR
This paper introduces a domain-invariant feature learning method for visual localization that improves image retrieval accuracy across changing environments by using a self-supervised training approach with ComboGAN.
Contribution
The paper proposes a novel domain-invariant feature learning approach using ComboGAN and feature consistency loss, enhancing retrieval-based localization under varying environmental conditions.
Findings
Outperforms state-of-the-art descriptors on CMU-Seasons dataset
Effective in high and medium precision localization scenarios
Enables robust image retrieval across different environmental domains
Abstract
Visual localization is a crucial problem in mobile robotics and autonomous driving. One solution is to retrieve images with known pose from a database for the localization of query images. However, in environments with drastically varying conditions (e.g. illumination changes, seasons, occlusion, dynamic objects), retrieval-based localization is severely hampered and becomes a challenging problem. In this paper, a novel domain-invariant feature learning method (DIFL) is proposed based on ComboGAN, a multi-domain image translation network architecture. By introducing a feature consistency loss (FCL) between the encoded features of the original image and translated image in another domain, we are able to train the encoders to generate domain-invariant features in a self-supervised manner. To retrieve a target image from the database, the query image is first encoded using the encoder…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
