Efficient Strategy Learning by Decoupling Searching and Pathfinding for Object Navigation

Yanwei Zheng; Shaopu Feng; Bowen Huang; Chuanlin Lan; Xiao Zhang; Dongxiao Yu

arXiv:2406.14103·cs.AI·July 23, 2025·1 cites

Efficient Strategy Learning by Decoupling Searching and Pathfinding for Object Navigation

Yanwei Zheng, Shaopu Feng, Bowen Huang, Chuanlin Lan, Xiao Zhang, Dongxiao Yu

PDF

Open Access

TL;DR

This paper introduces a two-stage reward mechanism and depth-enhanced pretraining to improve object navigation by decoupling searching and pathfinding behaviors, resulting in better exploration and efficiency.

Contribution

The study proposes a novel two-stage reward mechanism and depth-aware pretraining to enhance strategy learning in object navigation tasks.

Findings

01

Outperforms state-of-the-art methods in success rate

02

Improves navigation efficiency and exploration ability

03

Effective on AI2-Thor and RoboTHOR datasets

Abstract

Inspired by human-like behaviors for navigation: first searching to explore unknown areas before discovering the target, and then the pathfinding of moving towards the discovered target, recent studies design parallel submodules to achieve different functions in the searching and pathfinding stages, while ignoring the differences in reward signals between the two stages. As a result, these models often cannot be fully trained or are overfitting on training scenes. Another bottleneck that restricts agents from learning two-stage strategies is spatial perception ability, since the studies used generic visual encoders without considering the depth information of navigation scenes. To release the potential of the model on strategy learning, we propose the Two-Stage Reward Mechanism (TSRM) for object navigation that decouples the searching and pathfinding behaviours in an episode, enabling…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · Robotic Path Planning Algorithms · Advanced Image and Video Retrieval Techniques