SR-Nav: Spatial Relationships Matter for Zero-shot Object Goal Navigation
Leyuan Fang, Zan Mao, Zijing Wang, Yinlong Yan

TL;DR
SR-Nav introduces a spatial relationship graph and modules that leverage object relationships to improve zero-shot object goal navigation, especially under partial observations, achieving state-of-the-art results.
Contribution
The paper proposes a novel framework that models spatial relationships among objects to enhance perception and planning in zero-shot navigation tasks.
Findings
Achieves state-of-the-art success rate on HM3D dataset.
Improves navigation efficiency by reducing exploration redundancy.
Enhances perception robustness through relationship matching.
Abstract
Zero-shot object-goal navigation aims to find target objects in unseen environments using only egocentric observation. Recent methods leverage foundation models' comprehension and reasoning capabilities to enhance navigation performance. However, when faced with poor viewpoints or weak semantic cues, foundation models often fail to support reliable reasoning in both perception and planning, resulting in inefficient or failed navigation. We observe that inherent relationships among objects and regions encode structured scene priors, which help agents infer plausible target locations even under partial observations. Motivated by this insight, we propose Spatial Relation-aware Navigation (SR-Nav), a framework that models both observed and experience-based spatial relationships to enhance both perception and planning. Specifically, SR-Nav first constructs a Dynamic Spatial Relationship…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Robotic Path Planning Algorithms · Robotics and Sensor-Based Localization
