Efficient Strategy Learning by Decoupling Searching and Pathfinding for Object Navigation
Yanwei Zheng, Shaopu Feng, Bowen Huang, Chuanlin Lan, Xiao Zhang, Dongxiao Yu

TL;DR
This paper introduces a two-stage reward mechanism and depth-enhanced pretraining to improve object navigation by decoupling searching and pathfinding behaviors, resulting in better exploration and efficiency.
Contribution
The study proposes a novel two-stage reward mechanism and depth-aware pretraining to enhance strategy learning in object navigation tasks.
Findings
Outperforms state-of-the-art methods in success rate
Improves navigation efficiency and exploration ability
Effective on AI2-Thor and RoboTHOR datasets
Abstract
Inspired by human-like behaviors for navigation: first searching to explore unknown areas before discovering the target, and then the pathfinding of moving towards the discovered target, recent studies design parallel submodules to achieve different functions in the searching and pathfinding stages, while ignoring the differences in reward signals between the two stages. As a result, these models often cannot be fully trained or are overfitting on training scenes. Another bottleneck that restricts agents from learning two-stage strategies is spatial perception ability, since the studies used generic visual encoders without considering the depth information of navigation scenes. To release the potential of the model on strategy learning, we propose the Two-Stage Reward Mechanism (TSRM) for object navigation that decouples the searching and pathfinding behaviours in an episode, enabling…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Robotic Path Planning Algorithms · Advanced Image and Video Retrieval Techniques
