HiRO-Nav: Hybrid ReasOning Enables Efficient Embodied Navigation
He Zhao, Yijun Yang, Zichuan Lin, Deheng Ye, Chunyan Miao

TL;DR
HiRO-Nav introduces an adaptive reasoning navigation agent that selectively engages in deliberative reasoning based on action entropy, improving efficiency and success in embodied navigation tasks.
Contribution
The paper presents HiRO-Nav, a novel agent that dynamically determines when to reason during navigation, combining hybrid training and reasoning strategies for enhanced performance.
Findings
High-entropy actions often lead to novel scenes or objects.
Improving high-entropy actions correlates with better task success.
HiRO-Nav outperforms baselines in success rate and token efficiency.
Abstract
Embodied navigation agents built upon large reasoning models (LRMs) can handle complex, multimodal environmental input and perform grounded reasoning per step to improve sequential decision-making for long-horizon tasks. However, a critical question remains: \textit{how can the reasoning capabilities of LRMs be harnessed intelligently and efficiently for long-horizon navigation tasks?} In simple scenes, agents are expected to act reflexively, while in complex ones they should engage in deliberate reasoning before acting.To achieve this, we introduce \textbf{H}ybr\textbf{i}d \textbf{R}eas\textbf{O}ning \textbf{Nav}igation (\textbf{HiRO-Nav}) agent, the first kind of agent capable of adaptively determining whether to perform thinking at every step based on its own action entropy. Specifically, by examining how the agent's action entropy evolves over the navigation trajectories, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
