On the benefits of output sparsity for multi-label classification
Evgenii Chzhen, Christophe Denis, Mohamed Hebiri, Joseph Salmon

TL;DR
This paper introduces a new sparse weighted Hamming loss for multi-label classification, leveraging output sparsity to improve generalization bounds and efficiency in large-scale problems, with empirical validation on real datasets.
Contribution
It proposes a novel loss function that exploits output sparsity, leading to better theoretical bounds and practical efficiency for large-scale multi-label classification.
Findings
Improved generalization bounds linear in output sparsity
Efficient optimization using convex surrogates
Empirical results outperforming non-weighted techniques
Abstract
The multi-label classification framework, where each observation can be associated with a set of labels, has generated a tremendous amount of attention over recent years. The modern multi-label problems are typically large-scale in terms of number of observations, features and labels, and the amount of labels can even be comparable with the amount of observations. In this context, different remedies have been proposed to overcome the curse of dimensionality. In this work, we aim at exploiting the output sparsity by introducing a new loss, called the sparse weighted Hamming loss. This proposed loss can be seen as a weighted version of classical ones, where active and inactive labels are weighted separately. Leveraging the influence of sparsity in the loss function, we provide improved generalization bounds for the empirical risk minimizer, a suitable property for large-scale problems.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies · Face and Expression Recognition · Machine Learning and Data Classification
