Leveraging Pretrained Language Models as Energy Functions for Glauber Dynamics Text Diffusion

Tarun Kathuria; Sachin Kumar

arXiv:2605.04291·cs.LG·May 7, 2026

Leveraging Pretrained Language Models as Energy Functions for Glauber Dynamics Text Diffusion

Tarun Kathuria, Sachin Kumar

PDF

TL;DR

This paper introduces a novel discrete diffusion language model using Glauber dynamics and pretrained language models as energy functions, achieving improved text generation quality and strong performance on reasoning tasks.

Contribution

It proposes a new diffusion framework leveraging pretrained language models as energy functions, enhancing text quality and reasoning capabilities over prior diffusion methods.

Findings

01

Outperforms previous diffusion-based language models.

02

Achieves competitive results with autoregressive models like GPT-2.

03

Excels in zero-shot reasoning and puzzle-solving tasks.

Abstract

We present a discrete diffusion-based language model using Glauber dynamics from statistical physics. Our main insight is that instead of trying to train a discrete state space diffusion model using Glauber dynamics with a uniform transition kernel as the forward process, one can set up an ``energy function'' based on pretrained causal/masked language models. When viewed as the stationary distribution, this energy function allows us to significantly improve the quality of the generated text. Incorporating UL2 as the pretrained model into our diffusion pipeline, we outperform prior diffusion based LMs and perform competitively with autoregressive models of comparable model sizes. Furthermore, our models are competitive with or outperform prior diffusion models and GPT-2 style auto-regressive models on zero-shot common sense reasoning tasks as well as planning and search tasks like Sudoku…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.