A Modern Perspective on Query Likelihood with Deep Generative Retrieval   Models

Oleg Lesota; Navid Rekabsaz; Daniel Cohen; Klaus Antonius; Grasserbauer; Carsten Eickhoff; Markus Schedl

arXiv:2106.13618·cs.IR·June 28, 2021

A Modern Perspective on Query Likelihood with Deep Generative Retrieval Models

Oleg Lesota, Navid Rekabsaz, Daniel Cohen, Klaus Antonius, Grasserbauer, Carsten Eickhoff, Markus Schedl

PDF

1 Repo

TL;DR

This paper introduces a probabilistic deep generative retrieval paradigm that leverages neural models like Transformers, demonstrating improved passage retrieval performance and uncertainty estimation for relevance ranking.

Contribution

It formalizes a new deep generative retrieval framework, introduces a novel T-PGN model combining Transformers and Pointer Generator Networks, and shows its superior performance on passage retrieval tasks.

Findings

01

T-PGN outperforms other generative models in retrieval tasks.

02

Uncertainty estimation enhances query and collection understanding.

03

Generative models improve cut-off prediction accuracy.

Abstract

Existing neural ranking models follow the text matching paradigm, where document-to-query relevance is estimated through predicting the matching score. Drawing from the rich literature of classical generative retrieval models, we introduce and formalize the paradigm of deep generative retrieval models defined via the cumulative probabilities of generating query terms. This paradigm offers a grounded probabilistic view on relevance estimation while still enabling the use of modern neural architectures. In contrast to the matching paradigm, the probabilistic nature of generative rankers readily offers a fine-grained measure of uncertainty. We adopt several current neural generative models in our framework and introduce a novel generative ranker (T-PGN), which combines the encoding capacity of Transformers with the Pointer Generator Network model. We conduct an extensive set of evaluation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

CPJKU/DeepGenIR
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.