TrajRAG: Retrieving Geometric-Semantic Experience for Zero-Shot Object Navigation
Yiyao Wang, Sixian Zhang, Keming Zhang, Xinhang Song, Songjie Du, Shuqiang Jiang

TL;DR
TrajRAG is a retrieval-augmented framework that enhances zero-shot object navigation by accumulating and utilizing episodic geometric-semantic experiences through a hierarchical, topological trajectory representation.
Contribution
It introduces a novel topological-polar trajectory representation and a hierarchical chunking structure for lifelong experience accumulation and retrieval in object navigation.
Findings
TrajRAG improves zero-shot ObjectNav performance on MP3D, HM3D-v1, and HM3D-v2 datasets.
Hierarchical chunking organizes similar trajectories into summaries for efficient retrieval.
Retrieving relevant past experiences guides better waypoint selection during navigation.
Abstract
Existing zero-shot Object Goal Navigation (ObjectNav) methods often exploit commonsense knowledge from large language or vision-language models to guide navigation. However, such knowledge arises from internet-scale text rather than embodied 3D experience, and episodic observations collected during navigation are typically discarded, preventing the accumulation of lifelong experience. To this end, we propose Trajectory RAG (TrajRAG), a retrieval-augmented generation framework that enhances large-model reasoning by retrieving geometric-semantic experiences. TrajRAG incrementally accumulates episodic observations from past navigation episodes. To structure these observations, we propose a topological-polar (topo-polar) trajectory representation that compactly encodes spatial layouts and semantic contexts, effectively removing redundancies in raw episodic observations. A hierarchical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
