Estimating Multi-label Accuracy using Labelset Distributions
Laurence A. F. Park, Jesse Read

TL;DR
This paper proposes methods to estimate confidence in multi-label classifiers by deriving expected accuracy from labelset distributions, improving interpretability and decision-making in real-world applications.
Contribution
It introduces and evaluates candidate functions to estimate expected accuracy from probabilistic multi-label distributions, enhancing confidence calibration.
Findings
Three candidate functions correlate well with expected accuracy.
Combining candidates improves estimation for Jaccard and exact match.
Methods are robust across different multi-label accuracy metrics.
Abstract
A multi-label classifier estimates the binary label state (relevant vs irrelevant) for each of a set of concept labels, for any given instance. Probabilistic multi-label classifiers provide a predictive posterior distribution over all possible labelset combinations of such label states (the powerset of labels) from which we can provide the best estimate, simply by selecting the labelset corresponding to the largest expected accuracy, over that distribution. For example, in maximizing exact match accuracy, we provide the mode of the distribution. But how does this relate to the confidence we may have in such an estimate? Confidence is an important element of real-world applications of multi-label classifiers (as in machine learning in general) and is an important ingredient in explainability and interpretability. However, it is not obvious how to provide confidence in the multi-label…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification
