On Aggregation in Ensembles of Multilabel Classifiers
Vu-Linh Nguyen, Eyke H\"ullermeier, Michael Rapp, Eneldo Loza, Menc\'ia, Johannes F\"urnkranz

TL;DR
This paper introduces a formal framework for ensemble multilabel classification, distinguishing two main approaches—predict-then-combine and combine-then-predict—and demonstrates their advantages over standard voting methods through experiments.
Contribution
The paper formalizes ensemble multilabel classification with two approaches, enabling tailored optimization for different loss functions, and empirically shows their superiority over traditional voting techniques.
Findings
CTP performs well for decomposable loss functions.
PCT is better suited for non-decomposable losses.
Both approaches outperform standard voting methods.
Abstract
While a variety of ensemble methods for multilabel classification have been proposed in the literature, the question of how to aggregate the predictions of the individual members of the ensemble has received little attention so far. In this paper, we introduce a formal framework of ensemble multilabel classification, in which we distinguish two principal approaches: "predict then combine" (PTC), where the ensemble members first make loss minimizing predictions which are subsequently combined, and "combine then predict" (CTP), which first aggregates information such as marginal label probabilities from the individual ensemble members, and then derives a prediction from this aggregation. While both approaches generalize voting techniques commonly used for multilabel ensembles, they allow to explicitly take the target performance measure into account. Therefore, concrete instantiations of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
