DiSMEC - Distributed Sparse Machines for Extreme Multi-label Classification
Rohit Babbar, Bernhard Shoelkopf

TL;DR
DiSMEC is a scalable distributed framework for extreme multi-label classification that avoids low-rank assumptions, efficiently learns classifiers for hundreds of thousands of labels, and improves accuracy over state-of-the-art methods.
Contribution
DiSMEC introduces a distributed, one-vs-rest linear classifier framework with explicit capacity control, handling large label spaces without low-rank assumptions.
Findings
Can learn classifiers for datasets with up to 670,000 labels within hours.
Achieves 10% better accuracy than SLEEC on some datasets.
Achieves 15% better accuracy than FastXML on some datasets.
Abstract
Extreme multi-label classification refers to supervised multi-label learning involving hundreds of thousands or even millions of labels. Datasets in extreme classification exhibit fit to power-law distribution, i.e. a large fraction of labels have very few positive instances in the data distribution. Most state-of-the-art approaches for extreme multi-label classification attempt to capture correlation among labels by embedding the label matrix to a low-dimensional linear sub-space. However, in the presence of power-law distributed extremely large and diverse label spaces, structural assumptions such as low rank can be easily violated. In this work, we present DiSMEC, which is a large-scale distributed framework for learning one-versus-rest linear classifiers coupled with explicit capacity control to control model size. Unlike most state-of-the-art methods, DiSMEC does not make any low…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies · Spam and Phishing Detection · Web Data Mining and Analysis
