Forward and Backward Feature Selection for Query Performance Prediction
S\'ebastien D\'ejean, Radu Tudor Ionescu, Josiane Mothe, Md Zia Ullah

TL;DR
This paper introduces a step-wise feature selection method for query performance prediction that balances effectiveness, interpretability, and efficiency, by selecting a subset of features for better model understanding and real-world applicability.
Contribution
It proposes a novel forward and backward feature selection framework that improves interpretability and efficiency of QPP models while maintaining competitive predictive performance.
Findings
Limited feature models match complex models in effectiveness.
Selected models are more efficient during inference.
A new feature is consistently useful across datasets.
Abstract
The goal of query performance prediction (QPP) is to automatically estimate the effectiveness of a search result for any given query, without relevance judgements. Post-retrieval features have been shown to be more effective for this task while being more expensive to compute than pre-retrieval features. Combining multiple post-retrieval features is even more effective, but state-of-the-art QPP methods are impossible to interpret because of the black-box nature of the employed machine learning models. However, interpretation is useful for understanding the predictive model and providing more answers about its behavior. Moreover, combining many post-retrieval features is not applicable to real-world cases, since the query running time is of utter importance. In this paper, we investigate a new framework for feature selection in which the trained model explains well the prediction. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsFeature Selection
