Discrete Modeling via Boundary Conditional Diffusion Processes
Yuxuan Gu, Xiaocheng Feng, Lei Huang, Yingsheng Wu, Zekun Zhou,, Weihong Zhong, Kun Zhu, Bing Qin

TL;DR
This paper introduces a boundary conditional diffusion framework for discrete data modeling, improving performance in language and image tasks by addressing the gap between discrete and continuous diffusion processes.
Contribution
The authors propose a novel two-step boundary conditional diffusion process that enhances discrete data modeling by estimating and rescaling boundary conditions, outperforming prior methods.
Findings
Outperforms previous state-of-the-art in language translation and summarization.
Achieves competitive results with auto-regressive transformers.
Sets new SOTA for categorical image generation on Cifar-10.
Abstract
We present an novel framework for efficiently and effectively extending the powerful continuous diffusion processes to discrete modeling. Previous approaches have suffered from the discrepancy between discrete data and continuous modeling. Our study reveals that the absence of guidance from discrete boundaries in learning probability contours is one of the main reasons. To address this issue, we propose a two-step forward process that first estimates the boundary as a prior distribution and then rescales the forward trajectory to construct a boundary conditional diffusion model. The reverse process is proportionally adjusted to guarantee that the learned contours yield more precise discrete data. Experimental results indicate that our approach achieves strong performance in both language modeling and discrete image generation tasks. In language modeling, our approach surpasses previous…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsSimulation Techniques and Applications
MethodsDiffusion
