Prototypical Extreme Multi-label Classification with a Dynamic Margin Loss
Kunal Dahiya, Diego Ortego, David Jim\'enez

TL;DR
PRIME introduces a novel prototypical contrastive learning approach for extreme multi-label classification, balancing efficiency and high performance by leveraging label prototypes and a shallow transformer encoder.
Contribution
The paper presents PRIME, a new XMC method that uses a data-to-prototype prediction framework with a shallow encoder and adaptive triplet loss, outperforming existing approaches.
Findings
Achieves state-of-the-art results on multiple benchmarks.
Balances efficiency and performance in large label spaces.
Uses a novel adaptive triplet loss for better training.
Abstract
Extreme Multi-label Classification (XMC) methods predict relevant labels for a given query in an extremely large label space. Recent works in XMC address this problem using deep encoders that project text descriptions to an embedding space suitable for recovering the closest labels. However, learning deep models can be computationally expensive in large output spaces, resulting in a trade-off between high performing brute-force approaches and efficient solutions. In this paper, we propose PRIME, a XMC method that employs a novel prototypical contrastive learning technique to reconcile efficiency and performance surpassing brute-force approaches. We frame XMC as a data-to-prototype prediction task where label prototypes aggregate information from related queries. More precisely, we use a shallow transformer encoder that we coin as Label Prototype Network, which enriches label…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsText and Document Classification Technologies
MethodsContrastive Learning · Triplet Loss
