CANDI: Hybrid Discrete-Continuous Diffusion Models
Patrick Pynadath, Jiaxin Shi, and Ruqi Zhang

TL;DR
CANDI introduces a hybrid diffusion framework that effectively combines discrete and continuous denoising, overcoming previous limitations and enhancing discrete data generation and guidance capabilities.
Contribution
The paper proposes CANDI, a novel hybrid diffusion model that decouples discrete and continuous corruption, enabling improved discrete data modeling and guidance.
Findings
CANDI avoids the temporal dissonance in diffusion processes.
CANDI outperforms masked diffusion in text generation at low NFE.
CANDI enables classifier guidance using off-the-shelf classifiers.
Abstract
While continuous diffusion has shown remarkable success in continuous domains such as image generation, its direct application to discrete data has underperformed compared to purely discrete formulations. This gap is counterintuitive, given that continuous diffusion learns score functions that enable joint evolution across multiple positions. To understand this gap, we introduce token identifiability as an analytical framework for understanding how Gaussian noise corrupts discrete data through two mechanisms: discrete identity corruption and continuous rank degradation. We reveal that these mechanisms scale differently with vocabulary size, creating a temporal dissonance: at noise levels where discrete corruption preserves enough structure for conditional learning, continuous denoising is trivial; at noise levels where continuous denoising is meaningful, discrete corruption destroys…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
