TL;DR
This paper introduces KG-HTC, a method that integrates knowledge graphs with large language models to improve zero-shot hierarchical text classification, especially in large, long-tail label spaces.
Contribution
The paper presents a novel approach combining knowledge graphs and retrieval-augmented generation to enhance LLMs for zero-shot HTC across multiple datasets.
Findings
KG-HTC significantly outperforms baselines in zero-shot settings.
Method improves understanding of label semantics at various hierarchy levels.
Effective in large label spaces with long-tail distributions.
Abstract
Hierarchical Text Classification (HTC) involves assigning documents to labels organized within a taxonomy. Most previous research on HTC has focused on supervised methods. However, in real-world scenarios, employing supervised HTC can be challenging due to a lack of annotated data. Moreover, HTC often faces issues with large label spaces and long-tail distributions. In this work, we present Knowledge Graphs for zero-shot Hierarchical Text Classification (KG-HTC), which aims to address these challenges of HTC in applications by integrating knowledge graphs with Large Language Models (LLMs) to provide structured semantic context during classification. Our method retrieves relevant subgraphs from knowledge graphs related to the input text using a Retrieval-Augmented Generation (RAG) approach. Our KG-HTC can enhance LLMs to understand label semantics at various hierarchy levels. We evaluate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Text and Document Classification Technologies · Topic Modeling
MethodsRoIAlign · Region Proposal Network · Convolution · 1x1 Convolution · Feature Pyramid Network · Hybrid Task Cascade
