Theory of Optimizing Pseudolinear Performance Measures: Application to F-measure
Shameem A Puthiya Parambath, Nicolas Usunier, Yves Grandvalet

TL;DR
This paper develops a theoretical framework for optimizing pseudo-linear performance measures like F-measure and Jaccard Index, providing algorithms with guarantees for classifier optimization across datasets and classifier types.
Contribution
It introduces a reduction of pseudo-linear measure optimization to cost-sensitive classification with unknown costs and offers a provably effective algorithm for F-measure maximization.
Findings
Theoretical properties of pseudo-linear measures are established.
A general reduction to cost-sensitive classification is proposed.
Numerical experiments demonstrate the importance of cost asymmetry and thresholding.
Abstract
Non-linear performance measures are widely used for the evaluation of learning algorithms. For example, -measure is a commonly used performance measure for classification problems in machine learning and information retrieval community. We study the theoretical properties of a subset of non-linear performance measures called pseudo-linear performance measures which includes -measure, \emph{Jaccard Index}, among many others. We establish that many notions of -measures and \emph{Jaccard Index} are pseudo-linear functions of the per-class false negatives and false positives for binary, multiclass and multilabel classification. Based on this observation, we present a general reduction of such performance measure optimization problem to cost-sensitive classification problem with unknown costs. We then propose an algorithm with provable guarantees to obtain an approximately optimal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFault Detection and Control Systems · Advanced Optimization Algorithms Research · Control Systems and Identification
