Convex Calibrated Surrogates for the Multi-Label F-Measure

Mingyuan Zhang; Harish G. Ramaswamy; Shivani Agarwal

arXiv:2009.07801·stat.ML·September 17, 2020·1 cites

Convex Calibrated Surrogates for the Multi-Label F-Measure

Mingyuan Zhang, Harish G. Ramaswamy, Shivani Agarwal

PDF

Open Access 1 Video

TL;DR

This paper develops convex surrogate loss functions for optimizing the multi-label F-measure, enabling more effective training of classifiers that balance precision and recall, with theoretical guarantees and empirical validation.

Contribution

It introduces a family of convex calibrated surrogates for the multi-label F-measure, decomposing the problem into binary probability estimations with regret transfer bounds.

Findings

01

Surrogates are calibrated for the F-measure.

02

Decomposition into binary probability estimation problems.

03

Empirical results confirm theoretical guarantees.

Abstract

The F-measure is a widely used performance measure for multi-label classification, where multiple labels can be active in an instance simultaneously (e.g. in image tagging, multiple tags can be active in any image). In particular, the F-measure explicitly balances recall (fraction of active labels predicted to be active) and precision (fraction of labels predicted to be active that are actually so), both of which are important in evaluating the overall performance of a multi-label classifier. As with most discrete prediction problems, however, directly optimizing the F-measure is computationally hard. In this paper, we explore the question of designing convex surrogate losses that are calibrated for the F-measure -- specifically, that have the property that minimizing the surrogate loss yields (in the limit of sufficient data) a Bayes optimal multi-label classifier for the F-measure. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Convex Calibrated Surrogates for the Multi-Label F-Measure· slideslive

Taxonomy

TopicsMachine Learning and Algorithms · Text and Document Classification Technologies · Machine Learning and Data Classification