HiRO-Nav: Hybrid ReasOning Enables Efficient Embodied Navigation

He Zhao; Yijun Yang; Zichuan Lin; Deheng Ye; Chunyan Miao

arXiv:2604.08232·cs.AI·April 10, 2026

HiRO-Nav: Hybrid ReasOning Enables Efficient Embodied Navigation

He Zhao, Yijun Yang, Zichuan Lin, Deheng Ye, Chunyan Miao

PDF

TL;DR

HiRO-Nav introduces an adaptive reasoning navigation agent that selectively engages in deliberative reasoning based on action entropy, improving efficiency and success in embodied navigation tasks.

Contribution

The paper presents HiRO-Nav, a novel agent that dynamically determines when to reason during navigation, combining hybrid training and reasoning strategies for enhanced performance.

Findings

01

High-entropy actions often lead to novel scenes or objects.

02

Improving high-entropy actions correlates with better task success.

03

HiRO-Nav outperforms baselines in success rate and token efficiency.

Abstract

Embodied navigation agents built upon large reasoning models (LRMs) can handle complex, multimodal environmental input and perform grounded reasoning per step to improve sequential decision-making for long-horizon tasks. However, a critical question remains: \textit{how can the reasoning capabilities of LRMs be harnessed intelligently and efficiently for long-horizon navigation tasks?} In simple scenes, agents are expected to act reflexively, while in complex ones they should engage in deliberate reasoning before acting.To achieve this, we introduce \textbf{H}ybr\textbf{i}d \textbf{R}eas\textbf{O}ning \textbf{Nav}igation (\textbf{HiRO-Nav}) agent, the first kind of agent capable of adaptively determining whether to perform thinking at every step based on its own action entropy. Specifically, by examining how the agent's action entropy evolves over the navigation trajectories, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.