DiffuGR: Generative Document Retrieval with Diffusion Language Models
Xinpeng Zhao, Zhaochun Ren, Yukun Zhao, Zhenyang Li, Mengqi Zhang, Jun Feng, Ran Chen, Ying Zhou, Zhumin Chen, Shuaiqiang Wang, Dawei Yin, Xin Xin

TL;DR
DiffuGR introduces a diffusion-based generative document retrieval method that improves accuracy and efficiency control over traditional auto-regressive models, enabling parallel DocID generation and dynamic trade-off management.
Contribution
It proposes a novel diffusion language model approach for document retrieval, addressing limitations of auto-regressive methods with parallel generation and controllable quality-latency trade-offs.
Findings
Outperforms strong auto-regressive retrievers on benchmarks.
Enables explicit control over retrieval quality and latency.
Demonstrates effective parallel DocID generation with diffusion refinement.
Abstract
Generative retrieval (GR) reframes document retrieval as an end-to-end task of generating sequential document identifiers (DocIDs). Existing GR methods predominantly rely on left-to-right auto-regressive decoding, which suffers from two fundamental limitations: (i) a \emph{mismatch between DocID generation and natural language generation}, whereby an incorrect DocID token generated at an early step can lead to entirely erroneous retrieval; and (ii) an \emph{inability to dynamically balance the trade-off between retrieval efficiency and accuracy}, which is crucial for practical applications. To tackle these challenges, we propose generative document retrieval with diffusion language models, termed \emph{DiffuGR}. DiffuGR formulates DocID generation as a discrete diffusion process. During training, DocIDs are corrupted through a stochastic masking process, and a diffusion language model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInformation Retrieval and Search Behavior · Topic Modeling · Handwritten Text Recognition Techniques
