Guaranteed Generation from Large Language Models

Minbeom Kim; Thibaut Thonet; Jos Rozen; Hwaran Lee; Kyomin Jung; Marc Dymetman

arXiv:2410.06716·cs.CL·July 3, 2025

Guaranteed Generation from Large Language Models

Minbeom Kim, Thibaut Thonet, Jos Rozen, Hwaran Lee, Kyomin Jung, Marc Dymetman

PDF

Open Access 1 Repo 1 Video 3 Reviews

TL;DR

This paper introduces GUARD, a method combining proposal distributions with rejection sampling to guarantee constraint satisfaction in large language model outputs while maintaining distributional similarity and improving efficiency.

Contribution

The paper proposes GUARD, a novel approach that guarantees constraint satisfaction in LLMs by integrating inference-time rejection sampling with theoretical analysis of distributional control.

Findings

01

GUARD achieves perfect constraint satisfaction in experiments.

02

GUARD maintains distributional closeness to the original model.

03

GUARD significantly improves inference efficiency.

Abstract

As large language models (LLMs) are increasingly used across various applications, there is a growing need to control text generation to satisfy specific constraints or requirements. This raises a crucial question: Is it possible to guarantee strict constraint satisfaction in generated outputs while preserving the distribution of the original model as much as possible? We first define the ideal distribution - the one closest to the original model, which also always satisfies the expressed constraint - as the ultimate goal of guaranteed generation. We then state a fundamental limitation, namely that it is impossible to reach that goal through autoregressive training alone. This motivates the necessity of combining training-time and inference-time methods to enforce such guarantees. Based on this insight, we propose GUARD, a simple yet effective approach that combines an autoregressive…

Peer Reviews

Decision·ICLR 2025 Poster

Reviewer 01Rating 3Confidence 5

Strengths

The papers motivate an important challenge for the existing LLMs and provide proofs that it is theoretically intractable to sample from the LLM distributions conditioned on logical constraints.

Weaknesses

The authors could have done a more extensive literature survey and demonstrate how their work relates to/differs from following approaches for constrained generation: FUDGE [1]: trains auxiliary classifiers on existing classification datasets and uses the classifiers to guide generation to satisfy the constraint. NeuroLogic A*esque Decoding [2]: performs lookahead decoding with heuristic functions that estimate how likely the constraint will be satisfied. NADO [3]: trains auxiliary classifie

Reviewer 02Rating 8Confidence 3

Strengths

Guaranteed generation is an important requirement in real applications of LLMs. Though relatively easy to come up with, the algorithm proposed in this paper do provide an effective solution for this problem. Meanwhile, Theorem 1 gives a good reminder for trials attempting to solve such problem without considering sampling strategies. Overall, I would be glad to see this paper being accepted, as an important progress under this specific problem.

Weaknesses

While this paper has its merit in terms of contribution, it did not give me a good impression at first glance. The following suggestions may be helpful to the authors: 1. Algorithm 1 is too short. It is frustrating to see a 3-line algorithm in a paper. Switching its position with that of Algorithm 2 would be much better. 2. The content of Theorem 2 is too simple to be a theorem. It's OK to put it as an equation. 3. For Theorem 1 in main text, I suggest to replace it with the complete version in

Reviewer 03Rating 5Confidence 4

Strengths

- **Clear Motivation and Intuition:** The authors provide clear intuitions about the need for distribution-preserving constraint satisfaction in language models and the challenges this entails. They thoroughly motivate their approach for achieving strict control over generations without substantially deviating from the original model’s output distribution. - **Theoretical Rigor:** The proposed method is supported by theoretical principles, providing a mathematically grounded mechanism that guara

Weaknesses

**Theory:** Although well-motivated, the novelty of the approach is somewhat unclear, as it largely combines established alignment techniques with a naïve rejection sampling method to achieve guaranteed generation from language models: - First, the authors propose a specific prompt (CAP) and/or a fine-tuned model (SFT, DPG), denoted as $a’$, to approximate the gold distribution $g$. These methods are commonly used to align model outputs with desired behaviors. However, there is no guarantee that

Code & Models

Repositories

naver/guard
pytorch

Videos

Guaranteed Generation from Large Language Models· slideslive

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings