TL;DR
This paper introduces a novel homography-based loss function for camera pose regression that simplifies training and improves reprojection error minimization in deep learning-based relocalization tasks.
Contribution
A new loss function based on multiplane homography integration that is easy to tune, does not require prior initialization, and outperforms existing loss functions in minimizing reprojection error.
Findings
Reduces mean square reprojection error during training
Does not require prior scene point initialization
Outperforms existing loss functions in experiments
Abstract
Some recent visual-based relocalization algorithms rely on deep learning methods to perform camera pose regression from image data. This paper focuses on the loss functions that embed the error between two poses to perform deep learning based camera pose regression. Existing loss functions are either difficult-to-tune multi-objective functions or present unstable reprojection errors that rely on ground truth 3D scene points and require a two-step training. To deal with these issues, we introduce a novel loss function which is based on a multiplane homography integration. This new function does not require prior initialization and only depends on physically interpretable hyperparameters. Furthermore, the experiments carried out on well established relocalization datasets show that it minimizes best the mean square reprojection error during training when compared with existing loss…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
