A plug-in approach to maximising precision at the top and recall at the   top

Dirk Tasche

arXiv:1804.03077·stat.ML·April 10, 2018·5 cites

A plug-in approach to maximising precision at the top and recall at the top

Dirk Tasche

PDF

Open Access

TL;DR

This paper demonstrates that in information retrieval and binary classification, optimal precision at the top and recall at the top are achieved by thresholding the posterior probability, based on a generalized cost-sensitive error minimization.

Contribution

It introduces a plug-in method for maximizing precision and recall at the top by linking these metrics to posterior probability thresholding, extending previous theoretical results.

Findings

01

Optimal top precision and recall are achieved by thresholding posterior probabilities.

02

The approach generalizes earlier results on cost-sensitive error minimization.

03

Provides a practical method for improving ranking performance in retrieval tasks.

Abstract

For information retrieval and binary classification, we show that precision at the top (or precision at k) and recall at the top (or recall at k) are maximised by thresholding the posterior probability of the positive class. This finding is a consequence of a result on constrained minimisation of the cost-sensitive expected classification error which generalises an earlier related result from the literature.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Text and Document Classification Technologies · Imbalanced Data Classification Techniques