Convex Calibrated Surrogates for the Multi-Label F-Measure
Mingyuan Zhang, Harish G. Ramaswamy, Shivani Agarwal

TL;DR
This paper develops convex surrogate loss functions for optimizing the multi-label F-measure, enabling more effective training of classifiers that balance precision and recall, with theoretical guarantees and empirical validation.
Contribution
It introduces a family of convex calibrated surrogates for the multi-label F-measure, decomposing the problem into binary probability estimations with regret transfer bounds.
Findings
Surrogates are calibrated for the F-measure.
Decomposition into binary probability estimation problems.
Empirical results confirm theoretical guarantees.
Abstract
The F-measure is a widely used performance measure for multi-label classification, where multiple labels can be active in an instance simultaneously (e.g. in image tagging, multiple tags can be active in any image). In particular, the F-measure explicitly balances recall (fraction of active labels predicted to be active) and precision (fraction of labels predicted to be active that are actually so), both of which are important in evaluating the overall performance of a multi-label classifier. As with most discrete prediction problems, however, directly optimizing the F-measure is computationally hard. In this paper, we explore the question of designing convex surrogate losses that are calibrated for the F-measure -- specifically, that have the property that minimizing the surrogate loss yields (in the limit of sufficient data) a Bayes optimal multi-label classifier for the F-measure. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMachine Learning and Algorithms · Text and Document Classification Technologies · Machine Learning and Data Classification
