Hybrid Structure-from-Motion and Camera Relocalization for Enhanced   Egocentric Localization

Jinjie Mai; Abdullah Hamdi; Silvio Giancola; Chen Zhao; Bernard Ghanem

arXiv:2407.08023·cs.CV·July 12, 2024

Hybrid Structure-from-Motion and Camera Relocalization for Enhanced Egocentric Localization

Jinjie Mai, Abdullah Hamdi, Silvio Giancola, Chen Zhao, Bernard Ghanem

PDF

Open Access 1 Repo

TL;DR

This paper introduces EgoLoc-v1, a hybrid SfM and camera relocalization pipeline that improves egocentric localization accuracy by combining 3D scans and 2D video frame matching, outperforming previous methods.

Contribution

The paper proposes a novel ensemble strategy and hybrid approach combining SfM and 2D-3D matching for egocentric camera pose estimation, achieving state-of-the-art results.

Findings

01

Achieves 1.5% higher success rate than previous state-of-the-art.

02

Utilizes a hybrid SfM and relocalization pipeline for better pose estimation.

03

Demonstrates significant improvement in overall success rate.

Abstract

We built our pipeline EgoLoc-v1, mainly inspired by EgoLoc. We propose a model ensemble strategy to improve the camera pose estimation part of the VQ3D task, which has been proven to be essential in previous work. The core idea is not only to do SfM for egocentric videos but also to do 2D-3D matching between existing 3D scans and 2D video frames. In this way, we have a hybrid SfM and camera relocalization pipeline, which can provide us with more camera poses, leading to higher QwP and overall success rate. Our method achieves the best performance regarding the most important metric, the overall success rate. We surpass previous state-of-the-art, the competitive EgoLoc, by $1.5%$ . The code is available at \url{https://github.com/Wayne-Mai/egoloc_v1}.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wayne-mai/egoloc_v1
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · Advanced Vision and Imaging · 3D Surveying and Cultural Heritage