TL;DR
This paper extends gradient boosting to multi-label classification, enabling optimization of various loss functions, including non-decomposable ones, and demonstrates its effectiveness on synthetic and benchmark datasets.
Contribution
It introduces a novel gradient boosting framework for multi-label classification that handles both decomposable and non-decomposable loss functions.
Findings
Effective optimization of Hamming and subset 0/1 loss functions.
Successful application on synthetic and benchmark datasets.
Identifies limitations and potential of the proposed method.
Abstract
In multi-label classification, where the evaluation of predictions is less straightforward than in single-label classification, various meaningful, though different, loss functions have been proposed. Ideally, the learning algorithm should be customizable towards a specific choice of the performance measure. Modern implementations of boosting, most prominently gradient boosted decision trees, appear to be appealing from this point of view. However, they are mostly limited to single-label classification, and hence not amenable to multi-label losses unless these are label-wise decomposable. In this work, we develop a generalization of the gradient boosting framework to multi-output problems and propose an algorithm for learning multi-label classification rules that is able to minimize decomposable as well as non-decomposable loss functions. Using the well-known Hamming loss and subset 0/1…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
