Multi-Head Encoding for Extreme Label Classification
Daojun Liang, Haixia Zhang, Dongfeng Yuan, Minggao Zhang

TL;DR
This paper introduces Multi-Head Encoding (MHE), a novel approach for extreme label classification that reduces computational complexity by decomposing labels into local components, achieving state-of-the-art results.
Contribution
The paper proposes MHE, a multi-head classifier mechanism that decomposes labels to significantly reduce computational load in extreme label classification tasks.
Findings
Achieves state-of-the-art performance on XLC benchmarks.
Reduces training and inference complexity significantly.
Provides theoretical guarantees comparable to vanilla classifiers.
Abstract
The number of categories of instances in the real world is normally huge, and each instance may contain multiple labels. To distinguish these massive labels utilizing machine learning, eXtreme Label Classification (XLC) has been established. However, as the number of categories increases, the number of parameters and nonlinear operations in the classifier also rises. This results in a Classifier Computational Overload Problem (CCOP). To address this, we propose a Multi-Head Encoding (MHE) mechanism, which replaces the vanilla classifier with a multi-head classifier. During the training process, MHE decomposes extreme labels into the product of multiple short local labels, with each head trained on these local labels. During testing, the predicted labels can be directly calculated from the local predictions of each head. This reduces the computational load geometrically. Then, according…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Music and Audio Processing · Modeling, Simulation, and Optimization
