Using Language Models as Closed-Loop High-Level Planners for Robotics Applications: A Brief Overview and Benchmarks

Hao Wang; Sathwik Karnik; Bea Lim; Somil Bansal

arXiv:2511.07410·cs.RO·April 28, 2026

Using Language Models as Closed-Loop High-Level Planners for Robotics Applications: A Brief Overview and Benchmarks

Hao Wang, Sathwik Karnik, Bea Lim, Somil Bansal

PDF

TL;DR

This paper investigates how to reliably integrate large language models as high-level planners in robotics, focusing on control horizon and warm-starting strategies to enhance performance and robustness.

Contribution

It provides empirical insights and practical recommendations for improving language model-based planning in robotic systems through controlled experiments.

Findings

01

Control horizon and warm-starting significantly affect planning performance.

02

Designed experiments yield actionable insights for robust LLM-based planning.

03

Implementation details and experiments are publicly available.

Abstract

Large Language Models (LLMs) and Vision Language Models (VLMs) have become popular tools for embodied high-level planning. However, their deployment in black-box settings often leads to unpredictable or costly errors. To harness their capabilities more reliably in robotic systems, we empirically investigate practical strategies for integrating language models as closed-loop planners. Concretely, we study how the control horizon and warm-starting impact the performance of language model-based planners. We design and conduct controlled experiments to extract actionable insights, providing recommendations that can help improve the performance and robustness of language model-based embodied planning. The full implementation and experiments are available on the project website

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.