Relative Pose Regression with Pose Auto-Encoders: Enhancing Accuracy and Data Efficiency for Retail Applications
Yoli Shavit, Yosi Keller

TL;DR
This paper introduces a novel PAE-based relative pose regression method that refines absolute camera localization, significantly improving accuracy and data efficiency for retail applications without extra storage requirements.
Contribution
It extends Pose Auto-Encoders to relative pose regression and proposes a refinement scheme that enhances localization accuracy with less training data.
Findings
PAE-based RPR outperforms image-based RPR models of similar architecture.
Refinement with PAE-based RPR improves indoor localization accuracy.
Achieves competitive results using only 30% of training data.
Abstract
Accurate camera localization is crucial for modern retail environments, enabling enhanced customer experiences, streamlined inventory management, and autonomous operations. While Absolute Pose Regression (APR) from a single image offers a promising solution, approaches that incorporate visual and spatial scene priors tend to achieve higher accuracy. Camera Pose Auto-Encoders (PAEs) have recently been introduced to embed such priors into APR. In this work, we extend PAEs to the task of Relative Pose Regression (RPR) and propose a novel re-localization scheme that refines APR predictions using PAE-based RPR, without requiring additional storage of images or pose data. We first introduce PAE-based RPR and establish its effectiveness by comparing it with image-based RPR models of equivalent architectures. We then demonstrate that our refinement strategy, driven by a PAE-based RPR, enhances…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
