Any-Class Presence Likelihood for Robust Multi-Label Classification with Abundant Negative Data

Dumindu Tissera; Omar Awadallah; Muhammad Umair Danish; Ayan Sadhu; Katarina Grolinger

arXiv:2506.05721·cs.LG·June 9, 2025

Any-Class Presence Likelihood for Robust Multi-Label Classification with Abundant Negative Data

Dumindu Tissera, Omar Awadallah, Muhammad Umair Danish, Ayan Sadhu, Katarina Grolinger

PDF

Open Access

TL;DR

This paper introduces a novel loss function for multi-label classification that effectively handles large amounts of negative data by estimating the likelihood of any class being present, improving performance without added complexity.

Contribution

It proposes a new loss function based on a normalized weighted geometric mean to better utilize negative data in multi-label classification tasks.

Findings

01

Consistent performance improvements across multiple datasets.

02

Up to 6.01 percentage points increase in F1 score.

03

No additional parameters or computational overhead.

Abstract

Multi-label Classification (MLC) assigns an instance to one or more non-exclusive classes. A challenge arises when the dataset contains a large proportion of instances with no assigned class, referred to as negative data, which can overwhelm the learning process and hinder the accurate identification and classification of positive instances. Nevertheless, it is common in MLC applications such as industrial defect detection, agricultural disease identification, and healthcare diagnosis to encounter large amounts of negative data. Assigning a separate negative class to these instances further complicates the learning objective and introduces unnecessary redundancies. To address this challenge, we redesign standard MLC loss functions by deriving a likelihood of any class being present, formulated by a normalized weighted geometric mean of the predicted class probabilities. We introduce a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText and Document Classification Technologies · Machine Learning and Data Classification · Imbalanced Data Classification Techniques