S3E-GNN: Sparse Spatial Scene Embedding with Graph Neural Networks for   Camera Relocalization

Ran Cheng; Xinyu Jiang; Yuan Chen; Lige Liu; Tao Sun

arXiv:2205.05861·cs.CV·May 13, 2022

S3E-GNN: Sparse Spatial Scene Embedding with Graph Neural Networks for Camera Relocalization

Ran Cheng, Xinyu Jiang, Yuan Chen, Lige Liu, Tao Sun

PDF

Open Access

TL;DR

S3E-GNN introduces a novel end-to-end learning framework utilizing graph neural networks for efficient and accurate camera relocalization, outperforming traditional methods in challenging environments.

Contribution

The paper presents a new learning-based approach combining scene embedding and GNNs for robust camera relocalization, enhancing accuracy over traditional techniques.

Findings

01

Outperforms Bag-of-words in relocalization accuracy

02

Effective in challenging environments

03

Utilizes scene embedding and graph neural networks

Abstract

Camera relocalization is the key component of simultaneous localization and mapping (SLAM) systems. This paper proposes a learning-based approach, named Sparse Spatial Scene Embedding with Graph Neural Networks (S3E-GNN), as an end-to-end framework for efficient and robust camera relocalization. S3E-GNN consists of two modules. In the encoding module, a trained S3E network encodes RGB images into embedding codes to implicitly represent spatial and semantic embedding code. With embedding codes and the associated poses obtained from a SLAM system, each image is represented as a graph node in a pose graph. In the GNN query module, the pose graph is transformed to form a embedding-aggregated reference graph for camera relocalization. We collect various scene datasets in the challenging environments to perform experiments. Our results demonstrate that S3E-GNN method outperforms the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications