Unlocking the Potentials of Retrieval-Augmented Generation for Diffusion Language Models

Chuanyue Yu; Jiahui Wang; Yuhan Li; Heng Chang; Ge Lan; Qingyun Sun; Jia Li; Jianxin Li; Ziwei Zhang

arXiv:2601.11342·cs.LG·January 19, 2026

Unlocking the Potentials of Retrieval-Augmented Generation for Diffusion Language Models

Chuanyue Yu, Jiahui Wang, Yuhan Li, Heng Chang, Ge Lan, Qingyun Sun, Jia Li, Jianxin Li, Ziwei Zhang

PDF

Open Access

TL;DR

This paper explores integrating Retrieval-Augmented Generation with Diffusion Language Models, identifies semantic drift issues, and proposes a novel method to improve semantic fidelity and generation precision.

Contribution

It systematically evaluates RAG with DLMs, identifies the semantic drift problem, and introduces SPREAD, a query-relevance-guided denoising strategy to enhance semantic alignment.

Findings

01

RAG with DLMs shows promising potential but has limited precision.

02

Semantic Drift (RSD) causes deviation from query semantics during generation.

03

SPREAD effectively reduces RSD and improves answer precision.

Abstract

Diffusion Language Models (DLMs) have recently demonstrated remarkable capabilities in natural language processing tasks. However, the potential of Retrieval-Augmented Generation (RAG), which shows great successes for enhancing large language models (LLMs), has not been well explored, due to the fundamental difference between LLM and DLM decoding. To fill this critical gap, we systematically test the performance of DLMs within the RAG framework. Our findings reveal that DLMs coupled with RAG show promising potentials with stronger dependency on contextual information, but suffer from limited generation precision. We identify a key underlying issue: Response Semantic Drift (RSD), where the generated answer progressively deviates from the query's original semantics, leading to low precision content. We trace this problem to the denoising strategies in DLMs, which fail to maintain semantic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning