Tackling Multilabel Imbalance through Label Decoupling and Data Resampling Hybridization
Francisco Charte, Antonio J. Rivera, Mar\'ia J. del Jesus and, Francisco Herrera

TL;DR
This paper introduces a hybrid resampling approach combining label decoupling with existing techniques to improve multilabel imbalance handling, supported by empirical analysis and practical guidelines.
Contribution
It proposes a novel hybrid method integrating REMEDIAL with popular resampling algorithms for multilabel imbalance, with comprehensive empirical evaluation.
Findings
Hybrid methods improve class balance in multilabel datasets.
Label decoupling influences resampling effectiveness.
Guidelines for combining decoupling and resampling techniques are provided.
Abstract
The learning from imbalanced data is a deeply studied problem in standard classification and, in recent times, also in multilabel classification. A handful of multilabel resampling methods have been proposed in late years, aiming to balance the labels distribution. However these methods have to face a new obstacle, specific for multilabel data, as is the joint appearance of minority and majority labels in the same data patterns. We proposed recently a new algorithm designed to decouple imbalanced labels concurring in the same instance, called REMEDIAL (\textit{REsampling MultilabEl datasets by Decoupling highly ImbAlanced Labels}). The goal of this work is to propose a procedure to hybridize this method with some of the best resampling algorithms available in the literature, including random oversampling, heuristic undersampling and synthetic sample generation techniques. These hybrid…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
