Explaining the Performance of Multi-label Classification Methods with Data Set Properties
Jasmin Bogatinovski, Ljup\v{c}o Todorovski, Sa\v{s}o D\v{z}eroski,, Dragi Kocev

TL;DR
This study uses meta-learning to analyze 40 multi-label classification datasets, revealing that label space properties are key to understanding dataset behavior and that hyperparameter tuning can improve performance but with variable cost-effectiveness.
Contribution
It provides a comprehensive meta-learning analysis of multi-label classification datasets, highlighting the importance of label space features and hyperparameter optimization impacts.
Findings
Label space features are most influential in dataset characterization.
Relationships among labels are more predictive than individual label distributions.
Hyperparameter tuning can enhance performance but may not always be cost-effective.
Abstract
Meta learning generalizes the empirical experience with different learning tasks and holds promise for providing important empirical insight into the behaviour of machine learning algorithms. In this paper, we present a comprehensive meta-learning study of data sets and methods for multi-label classification (MLC). MLC is a practically relevant machine learning task where each example is labelled with multiple labels simultaneously. Here, we analyze 40 MLC data sets by using 50 meta features describing different properties of the data. The main findings of this study are as follows. First, the most prominent meta features that describe the space of MLC data sets are the ones assessing different aspects of the label space. Second, the meta models show that the most important meta features describe the label space, and, the meta features describing the relationships among the labels tend…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Text and Document Classification Technologies
