# Visual place recognition with panoramic images using hybrid neural network models

**Authors:** Lars Offermann

PMC · DOI: 10.1038/s41598-025-34473-7 · Scientific Reports · 2026-01-08

## TL;DR

This paper introduces hybrid neural network models for visual place recognition in robots, improving performance under changing conditions like lighting and camera tilt.

## Contribution

The novel hybrid models combine CNNs with algorithmic VPR methods, trained end-to-end for improved robustness.

## Key findings

- Hybrid Visual Compass improves tilt tolerance in visual place recognition.
- Hybrid MinWarping is robust against illumination changes and object rearrangement.
- Hybrid models outperform sparse feature-based methods for upright images.

## Abstract

A mobile robot can localize itself in a mapped area by finding a recorded image of a visited place that is most similar to the current view, a technique known as Visual Place Recognition (VPR). We focus on VPR with panoramic images in indoor environments and on direct VPR methods in contrast to feature-based methods. In this context, a key challenge of VPR are appearance changes in the environment, e.g. due to variations in illumination, camera tilt, and rearrangement of objects. To improve the quality in these situations, we propose a novel combination of convolutional neural networks (CNNs) for image preprocessing with two algorithmic solutions for VPR, the Visual Compass and MinWarping. Here, the CNN is fused with the algorithmic VPR method such that the training of the neural network includes backpropagation through both parts, which we refer to as a hybrid model. We show that the hybrid Visual Compass substantially improves tilt tolerance, resulting in a versatile model, while hybrid MinWarping is especially robust against illumination changes and object rearrangement. As an adjacent application to VPR, the hybrid MinWarping algorithm can also be used to estimate the relative pose of the query with respect to a previous image. We analyze how the network solution to image processing changes fundamentally to satisfy the unique requirements of each application. We also show that the VPR hybrid models compare favorably for upright images with a competing solution based on sparse local features.

## Full-text entities

- **Diseases:** RPE (MESH:D000080822)
- **Chemicals:** VPR (-)
- **Species:** Actinopterygii (fishes, superclass) [taxon 7898]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12791132/full.md

## Figures

20 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12791132/full.md

## References

35 references — full list in the complete paper: https://tomesphere.com/paper/PMC12791132/full.md

---
Source: https://tomesphere.com/paper/PMC12791132