TL;DR
This paper introduces a domain-invariant feature extraction and a novel similarity mapping loss for improved retrieval-based visual localization under environmental variability, achieving state-of-the-art results.
Contribution
It proposes a probabilistic architecture for domain-invariant features, a gradient-weighted similarity activation mapping loss, and an adaptive triplet loss for enhanced contrastive learning.
Findings
Outperforms state-of-the-art in challenging environments
Demonstrates strong generalization across datasets
Effective under illumination, vegetation, and night conditions
Abstract
Visual localization is a crucial component in the application of mobile robot and autonomous driving. Image retrieval is an efficient and effective technique in image-based localization methods. Due to the drastic variability of environmental conditions, e.g. illumination, seasonal and weather changes, retrieval-based visual localization is severely affected and becomes a challenging problem. In this work, a general architecture is first formulated probabilistically to extract domain invariant feature through multi-domain image translation. And then a novel gradient-weighted similarity activation mapping loss (Grad-SAM) is incorporated for finer localization with high accuracy. We also propose a new adaptive triplet loss to boost the contrastive learning of the embedding in a self-supervised manner. The final coarse-to-fine image retrieval pipeline is implemented as the sequential…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsContrastive Learning · Triplet Loss
