TL;DR
This paper introduces a probabilistic deep generative retrieval paradigm that leverages neural models like Transformers, demonstrating improved passage retrieval performance and uncertainty estimation for relevance ranking.
Contribution
It formalizes a new deep generative retrieval framework, introduces a novel T-PGN model combining Transformers and Pointer Generator Networks, and shows its superior performance on passage retrieval tasks.
Findings
T-PGN outperforms other generative models in retrieval tasks.
Uncertainty estimation enhances query and collection understanding.
Generative models improve cut-off prediction accuracy.
Abstract
Existing neural ranking models follow the text matching paradigm, where document-to-query relevance is estimated through predicting the matching score. Drawing from the rich literature of classical generative retrieval models, we introduce and formalize the paradigm of deep generative retrieval models defined via the cumulative probabilities of generating query terms. This paradigm offers a grounded probabilistic view on relevance estimation while still enabling the use of modern neural architectures. In contrast to the matching paradigm, the probabilistic nature of generative rankers readily offers a fine-grained measure of uncertainty. We adopt several current neural generative models in our framework and introduce a novel generative ranker (T-PGN), which combines the encoding capacity of Transformers with the Pointer Generator Network model. We conduct an extensive set of evaluation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
