APS: A Large-Scale Multi-Modal Indoor Camera Positioning System
Ali Ghofrani, Rahil Mahdian Toroghi, Seyed Mojtaba Tabatabaie

TL;DR
This paper introduces APS, a multi-modal end-to-end indoor positioning system that combines GAN-based point cloud reconstruction with CNN-based pose estimation, achieving sub-centimeter accuracy in large-scale environments.
Contribution
The paper proposes a novel multi-modal system integrating GANs and CNNs for indoor positioning, and creates a new dataset for RGB and point cloud pairs in indoor environments.
Findings
Achieves less than one centimeter positioning accuracy.
Effectively combines GAN and CNN for robust indoor localization.
Introduces a new dataset for RGB and point cloud pairs.
Abstract
Navigation inside a closed area with no GPS-signal accessibility is a highly challenging task. In order to tackle this problem, recently the imaging-based methods have grabbed the attention of many researchers. These methods either extract the features (e.g. using SIFT, or SOSNet) and map the descriptive ones to the camera position and rotation information, or deploy an end-to-end system that directly estimates this information out of RGB images, similar to PoseNet. While the former methods suffer from heavy computational burden during the test process, the latter suffers from lack of accuracy and robustness against environmental changes and object movements. However, end-to-end systems are quite fast during the test and inference and are pretty qualified for real-world applications, even though their training phase could be longer than the former ones. In this paper, a novel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsHuMan(Expedia)||How do I get a human at Expedia? · Concatenated Skip Connection · PatchGAN · *Communicated@Fast*How Do I Communicate to Expedia? · Convolution · Sigmoid Activation · Dropout · Batch Normalization · Pix2Pix
