Hierarchical Text Classification with Reinforced Label Assignment

Yuning Mao; Jingjing Tian; Jiawei Han; Xiang Ren

arXiv:1908.10419·cs.IR·June 19, 2020

Hierarchical Text Classification with Reinforced Label Assignment

Yuning Mao, Jingjing Tian, Jiawei Han, Xiang Ren

PDF

1 Repo

TL;DR

This paper introduces HiLAP, a reinforcement learning-based framework for hierarchical text classification that consistently models label dependencies during training and inference, significantly improving accuracy over existing methods.

Contribution

It formulates HTC as a Markov decision process and proposes a deep reinforcement learning approach for label assignment, addressing previous inconsistencies and modeling label dependencies effectively.

Findings

01

Achieves 33.4% average improvement in Macro-F1 over flat classifiers.

02

Outperforms state-of-the-art HTC methods on five datasets.

03

Flexible framework compatible with various neural encoders.

Abstract

While existing hierarchical text classification (HTC) methods attempt to capture label hierarchies for model training, they either make local decisions regarding each label or completely ignore the hierarchy information during inference. To solve the mismatch between training and inference as well as modeling label dependencies in a more principled way, we formulate HTC as a Markov decision process and propose to learn a Label Assignment Policy via deep reinforcement learning to determine where to place an object and when to stop the assignment process. The proposed method, HiLAP, explores the hierarchy during both training and inference time in a consistent manner and makes inter-dependent decisions. As a general framework, HiLAP can incorporate different neural encoders as base models for end-to-end training. Experiments on five public datasets and four base models show that HiLAP…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

morningmoni/HiLAP
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsConvolution