AttentionXML: Label Tree-based Attention-Aware Deep Model for High-Performance Extreme Multi-Label Text Classification
Ronghui You, Zihan Zhang, Ziye Wang, Suyang Dai, Hiroshi Mamitsuka and, Shanfeng Zhu

TL;DR
AttentionXML introduces a label tree-based deep learning model with attention mechanisms for high-performance extreme multi-label text classification, effectively capturing relevant subtext and scaling to millions of labels.
Contribution
The paper presents a novel AttentionXML model combining multi-label attention and a shallow probabilistic label tree to improve scalability and accuracy in XMTC tasks.
Findings
Outperforms eight state-of-the-art methods on six benchmark datasets.
Achieves superior performance on tail labels.
Successfully scales to datasets with around 3 million labels.
Abstract
Extreme multi-label text classification (XMTC) is an important problem in the era of big data, for tagging a given text with the most relevant multiple labels from an extremely large-scale label set. XMTC can be found in many applications, such as item categorization, web page tagging, and news annotation. Traditionally most methods used bag-of-words (BOW) as inputs, ignoring word context as well as deep semantic information. Recent attempts to overcome the problems of BOW by deep learning still suffer from 1) failing to capture the important subtext for each label and 2) lack of scalability against the huge number of labels. We propose a new label tree-based deep learning model for XMTC, called AttentionXML, with two unique features: 1) a multi-label attention mechanism with raw text as input, which allows to capture the most relevant part of text to each label; and 2) a shallow and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies · Sentiment Analysis and Opinion Mining · Advanced Text Analysis Techniques
