TL;DR
This paper introduces a generalized contrastive loss for training Siamese networks with continuous similarity measures, improving place recognition performance and reducing training complexity.
Contribution
It proposes a novel GCL function for continuous similarity, along with automatic annotation techniques, enhancing CNN training and outperforming state-of-the-art methods.
Findings
Outperforms existing methods like NetVLAD and Patch-NetVLAD.
Generalizes well across multiple datasets.
Eliminates the need for complex pair mining during training.
Abstract
Visual place recognition is a challenging task in computer vision and a key component of camera-based localization and navigation systems. Recently, Convolutional Neural Networks (CNNs) achieved high results and good generalization capabilities. They are usually trained using pairs or triplets of images labeled as either similar or dissimilar, in a binary fashion. In practice, the similarity between two images is not binary, but continuous. Furthermore, training these CNNs is computationally complex and involves costly pair and triplet mining strategies. We propose a Generalized Contrastive loss (GCL) function that relies on image similarity as a continuous measure, and use it to train a siamese CNN. Furthermore, we present three techniques for automatic annotation of image pairs with labels indicating their degree of similarity, and deploy them to re-annotate the MSLS, TB-Places, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSiamese Network
