On Coherence-based Predictors for Dense Query Performance Prediction
Maria Vlachou, Craig Macdonald

TL;DR
This paper introduces coherence-based predictors using neural embeddings to improve query performance prediction for dense retrieval models, revealing the influence of query types on predictor stability.
Contribution
It develops novel unsupervised coherence-based predictors tailored for dense retrieval models and analyzes how query types affect predictor performance and stability.
Findings
Achieved up to 92% accuracy improvement for TCT-ColBERT and 188% for ANCE over sparse variants.
Identified query types significantly influence query performance and predictor stability.
Dense retrieval models exhibit unique sensitivity to query type variations.
Abstract
Query Performance Prediction (QPP) estimates the effectiveness of a search engine's results in response to a query without relevance judgments. Traditionally, post-retrieval predictors have focused upon either the distribution of the retrieval scores, or the coherence of the top-ranked documents using traditional bag-of-words index representations. More recently, BERT-based models using dense embedded document representations have been used to create new predictors, but mostly applied to predict the performance of rankings created by BM25. Instead, we aim to predict the effectiveness of rankings created by single-representation dense retrieval models (ANCE & TCT-ColBERT). Therefore, we propose a number of variants of existing unsupervised coherence-based predictors that employ neural embedding representations. In our experiments on the TREC Deep Learning Track datasets, we demonstrate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInformation Retrieval and Search Behavior · Text and Document Classification Technologies · Topic Modeling
