AERR-Nav: Adaptive Exploration-Recovery-Reminiscing Strategy for Zero-Shot Object Navigation
Jingzhi Huang, Junkai Huang, Haoyang Yang, Haoang Li, Yi Wang

TL;DR
AERR-Nav introduces an adaptive strategy for zero-shot object navigation that dynamically balances exploration and exploitation, significantly improving performance in complex multi-floor environments.
Contribution
The paper proposes AERR-Nav, a novel framework with adaptive exploration, recovery, and reminiscing strategies, enabling better navigation in unseen multi-floor environments.
Findings
Achieves state-of-the-art results on HM3D and MP3D benchmarks.
Demonstrates effective balancing of exploration and exploitation.
Validates the proposed modules through extensive ablation studies.
Abstract
Zero-Shot Object Navigation (ZSON) in unknown multi-floor environments presents a significant challenge. Recent methods, mostly based on semantic value greedy waypoint selection, spatial topology-enhanced memory, and Multimodal Large Language Model (MLLM) as a decision-making framework, have led to improvements. However, these architectures struggle to balance exploration and exploitation for ZSON when encountering unseen environments, especially in multi-floor settings, such as robots getting stuck at narrow intersections, endlessly wandering, or failing to find stair entrances. To overcome these challenges, we propose AERR-Nav, a Zero-Shot Object Navigation framework that dynamically adjusts its state based on the robot's environment. Specifically, AERR-Nav has the following two key advantages: (1) An Adaptive Exploration-Recovery-Reminiscing Strategy, enables robots to dynamically…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Path Planning Algorithms · Multimodal Machine Learning Applications · Robotics and Sensor-Based Localization
