Advances in Embodied Navigation Using Large Language Models: A Survey
Jinzhou Lin, Han Gao, Xuxiang Feng, Rongtao Xu, Changwei Wang, Man, Zhang, Li Guo, Shibiao Xu

TL;DR
This survey reviews how large language models enhance embodied navigation by improving environmental understanding and decision-making, highlighting current models, methodologies, advantages, disadvantages, and future research directions.
Contribution
It provides an exhaustive overview of the integration of LLMs with embodied navigation, summarizing state-of-the-art models, research approaches, and assessing their strengths and weaknesses.
Findings
LLMs significantly improve environmental perception in navigation tasks.
Current models demonstrate robust decision-making capabilities.
Future research will focus on addressing limitations and expanding applications.
Abstract
In recent years, the rapid advancement of Large Language Models (LLMs) such as the Generative Pre-trained Transformer (GPT) has attracted increasing attention due to their potential in a variety of practical applications. The application of LLMs with Embodied Intelligence has emerged as a significant area of focus. Among the myriad applications of LLMs, navigation tasks are particularly noteworthy because they demand a deep understanding of the environment and quick, accurate decision-making. LLMs can augment embodied intelligence systems with sophisticated environmental perception and decision-making support, leveraging their robust language and image-processing capabilities. This article offers an exhaustive summary of the symbiosis between LLMs and embodied intelligence with a focus on navigation. It reviews state-of-the-art models, research methodologies, and assesses the advantages…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational and Text Analysis Methods · Multimodal Machine Learning Applications · Topic Modeling
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Dropout · Residual Connection · Byte Pair Encoding · Dense Connections · Layer Normalization · Label Smoothing · Position-Wise Feed-Forward Layer
