Learning Early Exit Strategies for Additive Ranking Ensembles

Francesco Busolin; Claudio Lucchese; Franco Maria Nardini; Salvatore; Orlando; Raffaele Perego; Salvatore Trani

arXiv:2105.02568·cs.IR·September 17, 2021

Learning Early Exit Strategies for Additive Ranking Ensembles

Francesco Busolin, Claudio Lucchese, Franco Maria Nardini, Salvatore, Orlando, Raffaele Perego, Salvatore Trani

PDF

Open Access 1 Repo

TL;DR

This paper introduces LEAR, a learned early exit strategy for ensemble ranking models that significantly reduces query response time by predicting when documents can be safely excluded early, without compromising ranking quality.

Contribution

LEAR is a novel learned technique that predicts early exit points in ensemble ranking, improving efficiency without sacrificing accuracy.

Findings

01

Achieves up to 5x speedup in query processing.

02

Maintains ranking quality with negligible loss in NDCG.

03

Effective in production-like settings on public datasets.

Abstract

Modern search engine ranking pipelines are commonly based on large machine-learned ensembles of regression trees. We propose LEAR, a novel - learned - technique aimed to reduce the average number of trees traversed by documents to accumulate the scores, thus reducing the overall query response time. LEAR exploits a classifier that predicts whether a document can early exit the ensemble because it is unlikely to be ranked among the final top-k results. The early exit decision occurs at a sentinel point, i.e., after having evaluated a limited number of trees, and the partial scores are exploited to filter out non-promising documents. We evaluate LEAR by deploying it in a production-like setting, adopting a state-of-the-art algorithm for ensembles traversal. We provide a comprehensive experimental evaluation on two public datasets. The experiments show that LEAR has a significant impact on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hpclab/learning-exit-strategies-ensembles
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInformation Retrieval and Search Behavior · Text and Document Classification Technologies · Data Quality and Management