SmallPlan: Leverage Small Language Models for Sequential Path Planning with Simulation-Powered, LLM-Guided Distillation
Quang P. M. Pham, Khoi T. N. Nguyen, Nhi H. Doan, Cuong A. Pham, Qinbo Sun, Weimin Qi, Kentaro Inui, Dezhen Song

TL;DR
SmallPlan introduces a framework where small language models are trained using large models as teachers to perform efficient, real-time path planning in robotics, suitable for edge devices, by combining simulation, supervised fine-tuning, and reinforcement learning.
Contribution
The paper presents a novel method for training lightweight language models for path planning using LLM-guided distillation, enabling efficient deployment in robotics.
Findings
SLMs achieve competitive performance with GPT-4o on path planning tasks.
SLMs are resource-efficient and suitable for edge deployment.
The approach reduces hallucination and overfitting in small models.
Abstract
Efficient path planning in robotics, particularly within large-scale, complex environments, remains a significant hurdle. While Large Language Models (LLMs) offer strong reasoning capabilities, their high computational cost and limited adaptability hinder real-time deployment on edge devices. We present SmallPlan - a novel framework leveraging LLMs as teacher models to train lightweight Small Language Models (SLMs) for high-level path planning tasks. In SmallPlan, the SLMs provide optimal action sequences to navigate across scene graphs that compactly represent full-scaled 3D scenes. The SLMs are trained in a simulation-powered, interleaved manner with LLM-guided supervised fine-tuning (SFT) and reinforcement learning (RL). This strategy not only enables SLMs to successfully complete navigation tasks but also makes them aware of important factors like distance travel, providing more…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Path Planning Algorithms · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsEmirates Airlines Office in Dubai · Attentive Walk-Aggregating Graph Neural Network
