Hierarchical Verbalizer for Few-Shot Hierarchical Text Classification
Ke Ji, Yixin Lian, Jingsheng Gao, Baoyuan Wang

TL;DR
This paper introduces HierVerb, a hierarchical verbalizer framework that enhances few-shot hierarchical text classification by integrating label hierarchy knowledge into prompt-based learning with pre-trained language models.
Contribution
The paper proposes HierVerb, a novel multi-verbalizer approach that incorporates hierarchical structure and contrastive learning for improved few-shot HTC performance.
Findings
HierVerb outperforms graph encoder-based hierarchy injection methods.
Prompt-based HTC with HierVerb significantly improves accuracy in few-shot settings.
The approach effectively bridges the gap between PLMs and hierarchical classification tasks.
Abstract
Due to the complex label hierarchy and intensive labeling cost in practice, the hierarchical text classification (HTC) suffers a poor performance especially when low-resource or few-shot settings are considered. Recently, there is a growing trend of applying prompts on pre-trained language models (PLMs), which has exhibited effectiveness in the few-shot flat text classification tasks. However, limited work has studied the paradigm of prompt-based learning in the HTC problem when the training data is extremely scarce. In this work, we define a path-based few-shot setting and establish a strict path-based evaluation metric to further explore few-shot HTC tasks. To address the issue, we propose the hierarchical verbalizer ("HierVerb"), a multi-verbalizer framework treating HTC as a single- or multi-label classification problem at multiple layers and learning vectors as verbalizers…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies
MethodsRegion Proposal Network · 1x1 Convolution · Feature Pyramid Network · RoIAlign · Convolution · Hybrid Task Cascade
