Towards Imbalanced Large Scale Multi-label Classification with Partially Annotated Labels
XIn Zhang, Yuqi Song, Fei Zuo, Xiaofeng Wang

TL;DR
This paper proposes a novel approach for large-scale multi-label classification with partial labels, addressing label imbalance through pseudo-labeling, a new loss function, and dynamic training, achieving superior results on benchmark datasets.
Contribution
It introduces a pseudo-labeling technique, a new imbalance-aware loss function, and a dynamic training scheme for effective learning with partial labels in large label spaces.
Findings
Outperforms state-of-the-art methods on multiple datasets.
In some cases, surpasses full-label training performance.
Effectively mitigates label imbalance and missing labels issues.
Abstract
Multi-label classification is a widely encountered problem in daily life, where an instance can be associated with multiple classes. In theory, this is a supervised learning method that requires a large amount of labeling. However, annotating data is time-consuming and may be infeasible for huge labeling spaces. In addition, label imbalance can limit the performance of multi-label classifiers, especially when some labels are missing. Therefore, it is meaningful to study how to train neural networks using partial labels. In this work, we address the issue of label imbalance and investigate how to train classifiers using partial labels in large labeling spaces. First, we introduce the pseudo-labeling technique, which allows commonly adopted networks to be applied in partially labeled settings without the need for additional complex structures. Then, we propose a novel loss function that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies · COVID-19 diagnosis using AI · Machine Learning and Data Classification
