Adaptive Guidance for Retrieval-Augmented Masked Diffusion Models
Jaemin Kim, Jong Chul Ye

TL;DR
This paper introduces ARAM, a training-free adaptive guidance method for Masked Diffusion Models in retrieval-augmented generation, which dynamically adjusts guidance based on context reliability to enhance question-answering accuracy.
Contribution
The work presents a novel adaptive guidance framework for diffusion models that calibrates guidance during denoising based on context reliability, addressing retrieval conflicts.
Findings
ARAM improves QA performance over baseline RAG methods.
Dynamic guidance calibration enhances handling of noisy retrieved context.
Extensive experiments validate the effectiveness of ARAM across benchmarks.
Abstract
Retrieval-Augmented Generation (RAG) improves factual grounding by incorporating external knowledge into language model generation. However, when retrieved context is noisy, unreliable, or inconsistent with the model's parametric knowledge, it introduces retrieval-prior conflicts that can degrade generation quality. While this problem has been studied in autoregressive language models, it remains largely unexplored in diffusion-based language models, where the iterative denoising process introduces unique challenges for integrating retrieved context. In this work, we propose Adaptive Retrieval-Augmented Masked Diffusion (ARAM), a training-free adaptive guidance framework for Masked Diffusion Models (MDMs) in RAG settings. ARAM dynamically calibrates the guidance scale during denoising according to the Signal-to-Noise Ratio (SNR) of the distributional shift induced by retrieved context.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
