TL;DR
This paper introduces STA-VPR, a novel spatio-temporal alignment method for visual place recognition that enhances CNN-based features' robustness to viewpoint and appearance changes without re-training.
Contribution
It proposes an adaptive DTW algorithm for spatial feature alignment and a local matching DTW for temporal sequence matching, improving VPR accuracy and efficiency.
Findings
Significantly improves CNN-based VPR performance.
Outperforms several state-of-the-art methods.
Maintains good run-time performance.
Abstract
Recently, the methods based on Convolutional Neural Networks (CNNs) have gained popularity in the field of visual place recognition (VPR). In particular, the features from the middle layers of CNNs are more robust to drastic appearance changes than handcrafted features and high-layer features. Unfortunately, the holistic mid-layer features lack robustness to large viewpoint changes. Here we split the holistic mid-layer features into local features, and propose an adaptive dynamic time warping (DTW) algorithm to align local features from the spatial domain while measuring the distance between two images. This realizes viewpoint-invariant and condition-invariant place recognition. Meanwhile, a local matching DTW (LM-DTW) algorithm is applied to perform image sequence matching based on temporal alignment, which achieves further improvements and ensures linear time complexity. We perform…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsDynamic Time Warping
