An Embodied AR Navigation Agent: Integrating BIM with Retrieval-Augmented Generation for Language Guidance
Hsuan-Kung Yang, Tsu-Ching Hsiao, Ryoichiro Oka, Ryuya Nishino, Satoko Tofukuji, Norimasa Kobori

TL;DR
This paper presents an embodied AR navigation system that combines BIM data with retrieval-augmented generation and multi-agent LLMs to provide flexible, natural language-guided navigation assistance with high usability.
Contribution
It introduces a novel integration of BIM with RAG and multi-agent LLMs for flexible, language-based AR navigation, enhancing interpretability and spatial reasoning capabilities.
Findings
Achieved a System Usability Scale score of 80.5 indicating excellent usability.
Demonstrated significant improvement in users' perception of system intelligence.
Validated system effectiveness through real-world user study.
Abstract
Delivering intelligent and adaptive navigation assistance in augmented reality (AR) requires more than visual cues, as it demands systems capable of interpreting flexible user intent and reasoning over both spatial and semantic context. Prior AR navigation systems often rely on rigid input schemes or predefined commands, which limit the utility of rich building data and hinder natural interaction. In this work, we propose an embodied AR navigation system that integrates Building Information Modeling (BIM) with a multi-agent retrieval-augmented generation (RAG) framework to support flexible, language-driven goal retrieval and route planning. The system orchestrates three language agents, Triage, Search, and Response, built on large language models (LLMs), which enables robust interpretation of open-ended queries and spatial reasoning using BIM data. Navigation guidance is delivered…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
