Exploring Object-Aware Attention Guided Frame Association for RGB-D SLAM

Ali Caglayan; Nevrez Imamoglu; Oguzhan Guclu; Ali Osman Serhatoglu; Ahmet Burak Can; Ryosuke Nakamura

arXiv:2510.26131·cs.CV·October 31, 2025

Exploring Object-Aware Attention Guided Frame Association for RGB-D SLAM

Ali Caglayan, Nevrez Imamoglu, Oguzhan Guclu, Ali Osman Serhatoglu, Ahmet Burak Can, Ryosuke Nakamura

PDF

TL;DR

This paper introduces a novel RGB-D SLAM method that leverages gradient-based, object-aware attention within CNN features to enhance frame association, especially in large indoor environments.

Contribution

It presents a new approach integrating layer-wise attention from network gradients into CNN features for improved SLAM performance.

Findings

01

Enhanced frame association accuracy in large environments

02

Improved SLAM robustness through attention-guided features

03

Superior performance over baseline methods

Abstract

Attention models have recently emerged as a powerful approach, demonstrating significant progress in various fields. Visualization techniques, such as class activation mapping, provide visual insights into the reasoning of convolutional neural networks (CNNs). Using network gradients, it is possible to identify regions where the network pays attention during image recognition tasks. Furthermore, these gradients can be combined with CNN features to localize more generalizable, task-specific attentive (salient) regions within scenes. However, explicit use of this gradient-based attention information integrated directly into CNN representations for semantic object understanding remains limited. Such integration is particularly beneficial for visual tasks like simultaneous localization and mapping (SLAM), where CNN representations enriched with spatially attentive object locations can…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.