Graph Attention Network for Camera Relocalization on Dynamic Scenes
Mohamed Amine Ouali, Mohamed Bouguessa, Riadh Ksantini

TL;DR
This paper introduces a graph attention network-based method for camera relocalization in dynamic scenes, leveraging scene mesh representations to improve pose estimation accuracy over previous scene-dependent models.
Contribution
It presents a novel end-to-end trainable framework combining graph neural networks and CNNs for 3D-3D matching using scene meshes, enhancing generalization to dynamic environments.
Findings
Achieves higher camera pose accuracy on RIO10 benchmark
Outperforms state-of-the-art methods in dynamic indoor relocalization
Effectively handles scene changes with mesh-based representations
Abstract
We devise a graph attention network-based approach for learning a scene triangle mesh representation in order to estimate an image camera position in a dynamic environment. Previous approaches built a scene-dependent model that explicitly or implicitly embeds the structure of the scene. They use convolution neural networks or decision trees to establish 2D/3D-3D correspondences. Such a mapping overfits the target scene and does not generalize well to dynamic changes in the environment. Our work introduces a novel approach to solve the camera relocalization problem by using the available triangle mesh. Our 3D-3D matching framework consists of three blocks: (1) a graph neural network to compute the embedding of mesh vertices, (2) a convolution neural network to compute the embedding of grid cells defined on the RGB-D image, and (3) a neural network model to establish the correspondence…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Robotics and Sensor-Based Localization · Human Pose and Action Recognition
MethodsGraph Neural Network · Convolution
