Gradient-based Label Binning in Multi-label Classification
Michael Rapp, Eneldo Loza Menc\'ia, Johannes F\"urnkranz, Eyke, H\"ullermeier

TL;DR
This paper introduces a gradient-based label binning method in multi-label classification to reduce computational costs of second-order boosting approaches while maintaining predictive accuracy.
Contribution
It proposes a novel label grouping technique that approximates second-order derivatives, significantly speeding up training in multi-label boosting models.
Findings
Training speed improves without loss of accuracy.
Label binning effectively reduces computational complexity.
Method maintains high predictive performance.
Abstract
In multi-label classification, where a single example may be associated with several class labels at the same time, the ability to model dependencies between labels is considered crucial to effectively optimize non-decomposable evaluation measures, such as the Subset 0/1 loss. The gradient boosting framework provides a well-studied foundation for learning models that are specifically tailored to such a loss function and recent research attests the ability to achieve high predictive accuracy in the multi-label setting. The utilization of second-order derivatives, as used by many recent boosting approaches, helps to guide the minimization of non-decomposable losses, due to the information about pairs of labels it incorporates into the optimization process. On the downside, this comes with high computational costs, even if the number of labels is small. In this work, we address the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
