Global Commander and Local Operative: A Dual-Agent Framework for Scene Navigation
Kaiming Jin, Yuefan Wu, Shengqiong Wu, Bobo Li, Shuicheng Yan, Tat-Seng Chua

TL;DR
This paper introduces DACo, a dual-agent framework for scene navigation that separates global planning from local execution, significantly improving long-horizon navigation performance in complex environments.
Contribution
DACo is a novel dual-agent architecture that disentangles global reasoning from local grounding, enhancing stability and robustness in vision-and-language navigation tasks.
Findings
Achieves up to 6.5% improvement over baselines in zero-shot settings.
Effectively generalizes across different backbone models.
Improves long-horizon navigation stability and resilience.
Abstract
Vision-and-Language Scene navigation is a fundamental capability for embodied human-AI collaboration, requiring agents to follow natural language instructions to execute coherent action sequences in complex environments. Existing approaches either rely on multiple agents, incurring high coordination and resource costs, or adopt a single-agent paradigm, which overloads the agent with both global planning and local perception, often leading to degraded reasoning and instruction drift in long-horizon settings. To address these issues, we introduce DACo, a planning-grounding decoupled architecture that disentangles global deliberation from local grounding. Concretely, it employs a Global Commander for high-level strategic planning and a Local Operative for egocentric observing and fine-grained execution. By disentangling global reasoning from local action, DACo alleviates cognitive overload…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Reinforcement Learning in Robotics · Robotic Path Planning Algorithms
