Conditional [MASK] Discrete Diffusion Language Model

Hyukhun Koh; Minha Jhang; Dohyung Kim; Sangmook Lee; Kyomin Jung

arXiv:2411.06438·cs.CL·February 25, 2025

Conditional [MASK] Discrete Diffusion Language Model

Hyukhun Koh, Minha Jhang, Dohyung Kim, Sangmook Lee, Kyomin Jung

PDF

Open Access

TL;DR

This paper introduces Diffusion-EAGS, a novel diffusion-based language model that combines conditional masked language models with entropy-adaptive techniques to improve diversity and controllability in non-autoregressive text generation.

Contribution

It proposes a new framework integrating conditional masked language models into diffusion models via a theoretical conditional Markov Random Field approach, with entropy-adaptive sampling methods.

Findings

01

Outperforms baseline models in quality-diversity tradeoff

02

Achieves superior diversity in generated text

03

Demonstrates effectiveness in non-autoregressive generation

Abstract

Although auto-regressive models excel in natural language processing, they often struggle to generate diverse text and provide limited controllability. Non-auto-regressive methods could be an alternative but often produce degenerate outputs and exhibit shortcomings in conditional generation. To address these challenges, we propose Diffusion-EAGS, a novel framework that integrates conditional masked language models into diffusion language models through the theoretical lens of a conditional Markov Random Field. In doing so, we propose entropy-adaptive Gibbs sampling and entropy-based noise scheduling to counterbalance each model's shortcomings. Experimental results show that Diffusion-EAGS outperforms baselines and achieves the best quality-diversity tradeoff, demonstrating its effectiveness in non-autoregressive text generation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Speech Recognition and Synthesis · Bayesian Methods and Mixture Models

MethodsDiffusion