An Effective Approach for Multi-label Classification with Missing Labels
Xin Zhang, Rabab Abdelfattah, Yuqi Song, Xiaofeng Wang

TL;DR
This paper introduces a pseudo-label based method and a novel loss function to improve multi-label classification with missing labels, reducing annotation costs and handling label imbalance effectively.
Contribution
It proposes a new approach combining pseudo-labels and a relaxed loss function to address missing labels in multi-label classification, outperforming existing methods.
Findings
Outperforms existing missing-label learning approaches on large-scale datasets.
Handles imbalance between positive and negative labels effectively.
Can sometimes match performance of fully labeled datasets.
Abstract
Compared with multi-class classification, multi-label classification that contains more than one class is more suitable in real life scenarios. Obtaining fully labeled high-quality datasets for multi-label classification problems, however, is extremely expensive, and sometimes even infeasible, with respect to annotation efforts, especially when the label spaces are too large. This motivates the research on partial-label classification, where only a limited number of labels are annotated and the others are missing. To address this problem, we first propose a pseudo-label based approach to reduce the cost of annotation without bringing additional complexity to the existing classification networks. Then we quantitatively study the impact of missing labels on the performance of classifier. Furthermore, by designing a novel loss function, we are able to relax the requirement that each…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies · Machine Learning in Bioinformatics · Machine Learning and Data Classification
