Estimating Multi-label Accuracy using Labelset Distributions

Laurence A. F. Park; Jesse Read

arXiv:2209.04163·cs.LG·September 12, 2022

Estimating Multi-label Accuracy using Labelset Distributions

Laurence A. F. Park, Jesse Read

PDF

Open Access

TL;DR

This paper proposes methods to estimate confidence in multi-label classifiers by deriving expected accuracy from labelset distributions, improving interpretability and decision-making in real-world applications.

Contribution

It introduces and evaluates candidate functions to estimate expected accuracy from probabilistic multi-label distributions, enhancing confidence calibration.

Findings

01

Three candidate functions correlate well with expected accuracy.

02

Combining candidates improves estimation for Jaccard and exact match.

03

Methods are robust across different multi-label accuracy metrics.

Abstract

A multi-label classifier estimates the binary label state (relevant vs irrelevant) for each of a set of concept labels, for any given instance. Probabilistic multi-label classifiers provide a predictive posterior distribution over all possible labelset combinations of such label states (the powerset of labels) from which we can provide the best estimate, simply by selecting the labelset corresponding to the largest expected accuracy, over that distribution. For example, in maximizing exact match accuracy, we provide the mode of the distribution. But how does this relate to the confidence we may have in such an estimate? Confidence is an important element of real-world applications of multi-label classifiers (as in machine learning in general) and is an important ingredient in explainability and interpretability. However, it is not obvious how to provide confidence in the multi-label…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification