Can LLM be a Good Path Planner based on Prompt Engineering? Mitigating the Hallucination for Path Planning
Hourui Deng, Hongjie Zhang, Jie Ou, Chaosheng Feng

TL;DR
This paper introduces S2RCQL, a novel approach combining spatial-to-relational transformation, Q-learning, and curriculum learning to improve LLMs' path planning by mitigating hallucinations and enhancing reasoning in maze environments.
Contribution
The study presents a new model, S2RCQL, that effectively reduces hallucinations in LLMs for path planning through innovative transformations, Q-learning integration, and curriculum learning techniques.
Findings
Achieved 23%-40% improvement in success rates.
Enhanced LLM reasoning capabilities in maze navigation.
Reduced spatial and context hallucinations.
Abstract
Spatial reasoning in Large Language Models (LLMs) is the foundation for embodied intelligence. However, even in simple maze environments, LLMs still encounter challenges in long-term path-planning, primarily influenced by their spatial hallucination and context inconsistency hallucination by long-term reasoning. To address this challenge, this study proposes an innovative model, Spatial-to-Relational Transformation and Curriculum Q-Learning (S2RCQL). To address the spatial hallucination of LLMs, we propose the Spatial-to-Relational approach, which transforms spatial prompts into entity relations and paths representing entity relation chains. This approach fully taps the potential of LLMs in terms of sequential thinking. As a result, we design a path-planning algorithm based on Q-learning to mitigate the context inconsistency hallucination, which enhances the reasoning ability of LLMs.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Path Planning Algorithms · Logic, programming, and type systems
MethodsQ-Learning
