DiffuRank: Effective Document Reranking with Diffusion Language Models
Qi Liu, Kun Ai, Jiaxin Mao, Yanzhao Zhang, Mingxin Li, Dingkun Long, Pengjun Xie, Fengbin Zhu, Ji-Rong Wen

TL;DR
This paper introduces DiffuRank, a novel document reranking framework using diffusion language models (dLLMs) that offers more flexible, efficient, and controllable reranking compared to traditional autoregressive LLMs.
Contribution
The paper proposes DiffuRank, leveraging dLLMs for reranking, and develops three strategies with training methods, demonstrating competitive performance on benchmarks.
Findings
dLLMs achieve comparable or better reranking performance than autoregressive LLMs.
DiffuRank enables parallel decoding, improving efficiency.
The approach is effective in both zero-shot and fine-tuned settings.
Abstract
Recent advances in large language models (LLMs) have inspired new paradigms for document reranking. While this paradigm better exploits the reasoning and contextual understanding capabilities of LLMs, most existing LLM-based rerankers rely on autoregressive generation, which limits their efficiency and flexibility. In particular, token-by-token decoding incurs high latency, while the fixed left-to-right generation order causes early prediction errors to propagate and is difficult to revise. To address these limitations, we explore the use of diffusion language models (dLLMs) for document reranking and propose DiffuRank, a reranking framework built upon dLLMs. Unlike autoregressive models, dLLMs support more flexible decoding and generation processes that are not constrained to a left-to-right order, and enable parallel decoding, which may lead to improved efficiency and controllability.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Computational and Text Analysis Methods · Generative Adversarial Networks and Image Synthesis
