Fusing Convolutional Neural Network and Geometric Constraint for Image-based Indoor Localization
Jingwei Song, Mitesh Patel, and Maani Ghaffari

TL;DR
This paper introduces a novel indoor localization method that combines CNN-based image descriptors with geometric constraints and motion models, improving accuracy especially with limited training data.
Contribution
It presents a fusion framework integrating CNN features, geometric constraints, and motion prediction for enhanced indoor localization accuracy.
Findings
Outperforms CNN-only methods in accuracy.
Effective with small training datasets.
Works well in diffuse indoor lighting conditions.
Abstract
This paper proposes a new image-based localization framework that explicitly localizes the camera/robot by fusing Convolutional Neural Network (CNN) and sequential images' geometric constraints. The camera is localized using a single or few observed images and training images with 6-degree-of-freedom pose labels. A Siamese network structure is adopted to train an image descriptor network, and the visually similar candidate image in the training set is retrieved to localize the testing image geometrically. Meanwhile, a probabilistic motion model predicts the pose based on a constant velocity assumption. The two estimated poses are finally fused using their uncertainties to yield an accurate pose prediction. This method leverages the geometric uncertainty and is applicable in indoor scenarios predominated by diffuse illumination. Experiments on simulation and real data sets demonstrate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Vision and Imaging · 3D Surveying and Cultural Heritage
MethodsSiamese Network
