Learning Early Exit Strategies for Additive Ranking Ensembles
Francesco Busolin, Claudio Lucchese, Franco Maria Nardini, Salvatore, Orlando, Raffaele Perego, Salvatore Trani

TL;DR
This paper introduces LEAR, a learned early exit strategy for ensemble ranking models that significantly reduces query response time by predicting when documents can be safely excluded early, without compromising ranking quality.
Contribution
LEAR is a novel learned technique that predicts early exit points in ensemble ranking, improving efficiency without sacrificing accuracy.
Findings
Achieves up to 5x speedup in query processing.
Maintains ranking quality with negligible loss in NDCG.
Effective in production-like settings on public datasets.
Abstract
Modern search engine ranking pipelines are commonly based on large machine-learned ensembles of regression trees. We propose LEAR, a novel - learned - technique aimed to reduce the average number of trees traversed by documents to accumulate the scores, thus reducing the overall query response time. LEAR exploits a classifier that predicts whether a document can early exit the ensemble because it is unlikely to be ranked among the final top-k results. The early exit decision occurs at a sentinel point, i.e., after having evaluated a limited number of trees, and the partial scores are exploited to filter out non-promising documents. We evaluate LEAR by deploying it in a production-like setting, adopting a state-of-the-art algorithm for ensembles traversal. We provide a comprehensive experimental evaluation on two public datasets. The experiments show that LEAR has a significant impact on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInformation Retrieval and Search Behavior · Text and Document Classification Technologies · Data Quality and Management
