Learning Term Weights for Ad-hoc Retrieval

B. Piwowarski

arXiv:1606.04223·cs.IR·June 15, 2016

Learning Term Weights for Ad-hoc Retrieval

B. Piwowarski

PDF

Open Access

TL;DR

This paper introduces a learning-based approach to compute term weights for ad-hoc retrieval, moving beyond traditional heuristics and probabilistic models by leveraging learning-to-rank techniques.

Contribution

It proposes a novel method to learn term weights directly from data, improving relevance scoring in information retrieval systems.

Findings

01

Demonstrates improved retrieval performance over traditional models

02

Introduces a data-driven approach to term weighting

03

Validates effectiveness on benchmark datasets

Abstract

Most Information Retrieval models compute the relevance score of a document for a given query by summing term weights specific to a document or a query. Heuristic approaches, like TF-IDF, or probabilistic models, like BM25, are used to specify how a term weight is computed. In this paper, we propose to leverage learning-to-rank principles to learn how to compute a term weight for a given document based on the term occurrence pattern.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Algorithms and Data Compression · Information Retrieval and Search Behavior