Exploring Object-Aware Attention Guided Frame Association for RGB-D SLAM
Ali Caglayan, Nevrez Imamoglu, Oguzhan Guclu, Ali Osman Serhatoglu,, Weimin Wang, Ahmet Burak Can, Ryosuke Nakamura

TL;DR
This paper introduces a novel approach that leverages task-specific network attention, derived from gradients, to enhance frame association in RGB-D indoor SLAM, leading to improved localization accuracy.
Contribution
It proposes integrating layer-wise attention information with CNN features to improve SLAM performance, a novel use of gradient-based attention in this context.
Findings
Improved frame association accuracy in RGB-D SLAM
Enhanced localization performance over baseline methods
Effective use of gradient-based attention for object-aware SLAM
Abstract
Deep learning models as an emerging topic have shown great progress in various fields. Especially, visualization tools such as class activation mapping methods provided visual explanation on the reasoning of convolutional neural networks (CNNs). By using the gradients of the network layers, it is possible to demonstrate where the networks pay attention during a specific image recognition task. Moreover, these gradients can be integrated with CNN features for localizing more generalized task dependent attentive (salient) objects in scenes. Despite this progress, there is not much explicit usage of this gradient (network attention) information to integrate with CNN representations for object semantics. This can be very useful for visual tasks such as simultaneous localization and mapping (SLAM) where CNN representations of spatially attentive object locations may lead to improved…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Robotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques
