Translating Natural Language to Planning Goals with Large-Language Models
Yaqi Xie, Chen Yu, Tongyao Zhu, Jinbin Bai, Ze Gong, Harold Soh

TL;DR
This paper investigates whether large language models can effectively translate natural language goals into structured planning languages, finding they excel at translation but struggle with numerical and spatial reasoning, highlighting their potential and limitations.
Contribution
The study demonstrates that LLMs can serve as effective natural language interfaces for goal translation in planning, with insights into their strengths and weaknesses.
Findings
LLMs are better at translation than planning.
They leverage commonsense to fill in missing goal details.
They struggle with numerical and spatial reasoning tasks.
Abstract
Recent large language models (LLMs) have demonstrated remarkable performance on a variety of natural language processing (NLP) tasks, leading to intense excitement about their applicability across various domains. Unfortunately, recent work has also shown that LLMs are unable to perform accurate reasoning nor solve planning problems, which may limit their usefulness for robotics-related tasks. In this work, our central question is whether LLMs are able to translate goals specified in natural language to a structured planning language. If so, LLM can act as a natural interface between the planner and human users; the translated goal can be handed to domain-independent AI planners that are very effective at planning. Our empirical results on GPT 3.5 variants show that LLMs are much better suited towards translation rather than planning. We find that LLMs are able to leverage commonsense…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · AI-based Problem Solving and Planning
MethodsMulti-Head Attention · Attention Is All You Need · fail · Softmax · Discriminative Fine-Tuning · Linear Layer · Residual Connection · Weight Decay · Byte Pair Encoding · Dropout
