Enabling Efficiency-Precision Trade-offs for Label Trees in Extreme Classification
Tavor Z. Baharav, Daniel L. Jiang, Kedarnath Kolluri, Sujay Sanghavi,, Inderjit S. Dhillon

TL;DR
This paper introduces an information theory-inspired algorithm that allows flexible trade-offs between efficiency and accuracy in label tree construction for extreme classification, improving latency without sacrificing accuracy.
Contribution
The authors propose a novel algorithm enabling interpolation between statistical performance and latency in label tree design for XMC, addressing a key challenge in the field.
Findings
Reduces expected latency by up to 28% on Wiki-500K dataset
Improves latency by up to 20% on e-commerce datasets
Maintains accuracy while optimizing latency
Abstract
Extreme multi-label classification (XMC) aims to learn a model that can tag data points with a subset of relevant labels from an extremely large label set. Real world e-commerce applications like personalized recommendations and product advertising can be formulated as XMC problems, where the objective is to predict for a user a small subset of items from a catalog of several million products. For such applications, a common approach is to organize these labels into a tree, enabling training and inference times that are logarithmic in the number of labels. While training a model once a label tree is available is well studied, designing the structure of the tree is a difficult task that is not yet well understood, and can dramatically impact both model latency and statistical performance. Existing approaches to tree construction fall at an extreme point, either optimizing exclusively for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
