Deep Learning Features at Scale for Visual Place Recognition
Zetao Chen, Adam Jacobson, Niko Sunderhauf, Ben Upcroft, Lingqiao Liu,, Chunhua Shen, Ian Reid, Michael Milford

TL;DR
This paper trains CNNs specifically for visual place recognition at scale, using a new large dataset, and demonstrates improved performance and insights into learned features compared to generic pre-trained networks.
Contribution
It introduces the SPED dataset for large-scale training and develops a multi-scale feature encoding method for invariant place recognition.
Findings
Achieved 10% performance improvement over existing methods.
Developed a new large-scale dataset (SPED) for training.
Provided insights into network features learned for place recognition.
Abstract
The success of deep learning techniques in the computer vision domain has triggered a range of initial investigations into their utility for visual place recognition, all using generic features from networks that were trained for other types of recognition tasks. In this paper, we train, at large scale, two CNN architectures for the specific place recognition task and employ a multi-scale feature encoding method to generate condition- and viewpoint-invariant features. To enable this training to occur, we have developed a massive Specific PlacEs Dataset (SPED) with hundreds of examples of place appearance change at thousands of different places, as opposed to the semantic place type datasets currently available. This new dataset enables us to set up a training regime that interprets place recognition as a classification problem. We comprehensively evaluate our trained networks on several…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Indoor and Outdoor Localization Technologies · Advanced Image and Video Retrieval Techniques
