RAGNav: A Retrieval-Augmented Topological Reasoning Framework for Multi-Goal Visual-Language Navigation

Ling Luo; Qiangian Bai

arXiv:2603.03745·cs.AI·March 5, 2026

RAGNav: A Retrieval-Augmented Topological Reasoning Framework for Multi-Goal Visual-Language Navigation

Ling Luo, Qiangian Bai

PDF

Open Access

TL;DR

RAGNav is a novel framework that combines topological and semantic reasoning with retrieval mechanisms to improve multi-goal vision-language navigation, addressing spatial hallucinations and planning drift.

Contribution

It introduces a Dual-Basis Memory system with topological and semantic structures, enabling better spatial reasoning and target screening in multi-goal VLN tasks.

Findings

01

Achieves state-of-the-art performance on complex multi-goal navigation tasks.

02

Enhances inter-target reachability reasoning and sequential planning efficiency.

03

Reduces semantic noise and spatial hallucinations in navigation.

Abstract

Vision-Language Navigation (VLN) is evolving from single-point pathfinding toward the more challenging Multi-Goal VLN. This task requires agents to accurately identify multiple entities while collaboratively reasoning over their spatial-physical constraints and sequential execution order. However, generic Retrieval-Augmented Generation (RAG) paradigms often suffer from spatial hallucinations and planning drift when handling multi-object associations due to the lack of explicit spatial modeling.To address these challenges, we propose RAGNav, a framework that bridges the gap between semantic reasoning and physical structure. The core of RAGNav is a Dual-Basis Memory system, which integrates a low-level topological map for maintaining physical connectivity with a high-level semantic forest for hierarchical environment abstraction. Building on this representation, the framework introduces…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Robotic Path Planning Algorithms · Constraint Satisfaction and Optimization