From Noise to Order: Learning to Rank via Denoising Diffusion
Sajad Ebrahimi, Bhaskar Mitra, Negar Arabzadeh, Ye Yuan, Haolun Wu, Fattane Zarrinkalam, Ebrahim Bagheri

TL;DR
This paper introduces DiffusionRank, a novel generative learning-to-rank method using denoising diffusion models to improve robustness and performance in information retrieval tasks.
Contribution
It extends diffusion-based generative models to the ranking domain, creating a new approach that models the full joint distribution of features and relevance labels.
Findings
DiffusionRank outperforms traditional discriminative models in IR tasks.
Generative modeling provides more robust ranking solutions.
Significant empirical improvements demonstrated over existing methods.
Abstract
In information retrieval (IR), learning-to-rank (LTR) methods have traditionally limited themselves to discriminative machine learning approaches that model the probability of the document being relevant to the query given some feature representation of the query-document pair. In this work, we propose an alternative denoising diffusion-based deep generative approach to LTR that instead models the full joint distribution over feature vectors and relevance labels. While in the discriminative setting, an over-parameterized ranking model may find different ways to fit the training data, we hypothesize that candidate solutions that can explain the full data distribution under the generative setting produce more robust ranking models. With this motivation, we propose DiffusionRank that extends TabDiff, an existing denoising diffusion-based generative model for tabular datasets, to create…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInformation Retrieval and Search Behavior · Image Retrieval and Classification Techniques · Text and Document Classification Technologies
