Meta-DiffuB: A Contextualized Sequence-to-Sequence Text Diffusion Model with Meta-Exploration
Yun-Yen Chuang, Hung-Min Hsu, Kevin Lin, Chen-Sheng Gu, Ling Zhen Li,, Ray-I Chang, Hung-yi Lee

TL;DR
Meta-DiffuB introduces a novel context-aware noise scheduling framework for sequence-to-sequence text diffusion, significantly improving generation quality and flexibility over existing models by employing meta-exploration for dynamic noise control.
Contribution
The paper presents Meta-DiffuB, a new scheduler-exploiter diffusion paradigm that uses meta-exploration to optimize contextualized noise scheduling in Seq2Seq tasks, outperforming prior models.
Findings
Achieves state-of-the-art results on four Seq2Seq benchmarks.
Demonstrates the effectiveness of contextualized noise scheduling.
The scheduler can be used as a plug-and-play component to improve existing models.
Abstract
The diffusion model, a new generative modeling paradigm, has achieved significant success in generating images, audio, video, and text. It has been adapted for sequence-to-sequence text generation (Seq2Seq) through DiffuSeq, termed S2S Diffusion. Existing S2S-Diffusion models predominantly rely on fixed or hand-crafted rules to schedule noise during the diffusion and denoising processes. However, these models are limited by non-contextualized noise, which fails to fully consider the characteristics of Seq2Seq tasks. In this paper, we propose the Meta-DiffuB framework - a novel scheduler-exploiter S2S-Diffusion paradigm designed to overcome the limitations of existing S2S-Diffusion models. We employ Meta-Exploration to train an additional scheduler model dedicated to scheduling contextualized noise for each sentence. Our exploiter model, an S2S-Diffusion model, leverages the noise…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Sequence to Sequence · Diffusion
