TL;DR
This paper introduces an order-free learning framework for multi-label classification that removes the need for predefined label sequences, reducing exposure bias and improving generalization to unseen label combinations.
Contribution
It proposes a novel order-free approach for MLC that alleviates exposure bias and enhances the ability to generate unseen label combinations.
Findings
Outperforms baseline models on three benchmark datasets.
Higher probability of generating unseen label combinations.
Demonstrates better generalization capability.
Abstract
Multi-label classification (MLC) assigns multiple labels to each sample. Prior studies show that MLC can be transformed to a sequence prediction problem with a recurrent neural network (RNN) decoder to model the label dependency. However, training a RNN decoder requires a predefined order of labels, which is not directly available in the MLC specification. Besides, RNN thus trained tends to overfit the label combinations in the training set and have difficulty generating unseen label sequences. In this paper, we propose a new framework for MLC which does not rely on a predefined label order and thus alleviates exposure bias. The experimental results on three multi-label classification benchmark datasets show that our method outperforms competitive baselines by a large margin. We also find the proposed approach has a higher probability of generating label combinations not seen during…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
