Scene Representation using 360{\deg} Saliency Graph and its Application in Vision-based Indoor Navigation
Preeti Meena, Himanshu Kumar, Sandeep Yadav

TL;DR
This paper introduces a novel 360-degree saliency graph for scene representation that improves indoor navigation by explicitly encoding visual, semantic, and geometric information, demonstrating robustness and enhanced localization.
Contribution
It proposes a new 360-degree saliency graph representation that explicitly encodes scene information, improving robustness and efficiency for vision-based indoor navigation.
Findings
Enhanced scene localization accuracy
Improved navigation performance in indoor environments
Robustness against illumination and occlusion challenges
Abstract
A Scene, represented visually using different formats such as RGB-D, LiDAR scan, keypoints, rectangular, spherical, multi-views, etc., contains information implicitly embedded relevant to applications such as scene indexing, vision-based navigation. Thus, these representations may not be efficient for such applications. This paper proposes a novel 360{\deg} saliency graph representation of the scenes. This rich representation explicitly encodes the relevant visual, contextual, semantic, and geometric information of the scene as nodes, edges, edge weights, and angular position in the 360{\deg} graph. Also, this representation is robust against scene view change and addresses challenges of indoor environments such as varied illumination, occlusions, and shadows as in the case of existing traditional methods. We have utilized this rich and efficient representation for vision-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Visual Attention and Saliency Detection · Advanced Image and Video Retrieval Techniques
