Hierarchical place recognition with omnidirectional images and curriculum learning-based loss functions
Marcos Alfaro, Juan Jos\'e Cabrera, Mar\'ia Flores, \'Oscar Reinoso, Luis Pay\'a

TL;DR
This paper introduces a hierarchical visual place recognition method using panoramic images and curriculum learning-based triplet loss functions, improving robustness and accuracy in challenging environments for mobile robots.
Contribution
It proposes a novel curriculum learning-based triplet loss for deep models using panoramic images, enhancing discriminative feature learning for place recognition.
Findings
Outperforms standard contrastive loss in challenging conditions
Effective in indoor and outdoor environments with illumination changes
Robust against noise, occlusions, and limited training data
Abstract
This paper addresses Visual Place Recognition (VPR), which is essential for the safe navigation of mobile robots. The solution we propose employs panoramic images and deep learning models, which are fine-tuned with triplet loss functions that integrate curriculum learning strategies. By progressively presenting more challenging examples during training, these loss functions enable the model to learn more discriminative and robust feature representations, overcoming the limitations of conventional contrastive loss functions. After training, VPR is tackled in two steps: coarse (room retrieval) and fine (position estimation). The results demonstrate that the curriculum-based triplet losses consistently outperform standard contrastive loss functions, particularly under challenging perceptual conditions. To thoroughly assess the robustness and generalization capabilities of the proposed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques · 3D Surveying and Cultural Heritage
MethodsTriplet Loss
