Robust Planning with Compound LLM Architectures: An LLM-Modulo Approach

Atharva Gundawar; Karthik Valmeekam; Mudit Verma; Subbarao Kambhampati

arXiv:2411.14484·cs.CL·November 25, 2024

Robust Planning with Compound LLM Architectures: An LLM-Modulo Approach

Atharva Gundawar, Karthik Valmeekam, Mudit Verma, Subbarao Kambhampati

PDF

Open Access 1 Repo

TL;DR

This paper introduces the LLM-Modulo framework, a robust compound architecture pairing LLMs with verifiers to ensure correct outputs in planning tasks, outperforming previous prompt-based methods.

Contribution

The paper presents a novel compound LLM architecture that guarantees correctness by integrating verifiers, addressing robustness issues in prior prompt engineering approaches.

Findings

01

Significant performance improvements across four scheduling domains

02

Verifiers effectively prevent fallacious outputs

03

Modifications to the framework impact overall performance

Abstract

Previous work has attempted to boost Large Language Model (LLM) performance on planning and scheduling tasks through a variety of prompt engineering techniques. While these methods can work within the distributions tested, they are neither robust nor predictable. This limitation can be addressed through compound LLM architectures where LLMs work in conjunction with other components to ensure reliability. In this paper, we present a technical evaluation of a compound LLM architecture--the LLM-Modulo framework. In this framework, an LLM is paired with a complete set of sound verifiers that validate its output, re-prompting it if it fails. This approach ensures that the system can never output any fallacious output, and therefore that every output generated is guaranteed correct--something previous techniques have not been able to claim. Our results, evaluated across four scheduling…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Atharva-Gundawar/LLM-Modulo-prompts
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI-based Problem Solving and Planning · Formal Methods in Verification · Logic, Reasoning, and Knowledge

MethodsSparse Evolutionary Training · Balanced Selection