Improving Low-Resource Sequence Labeling with Knowledge Fusion and Contextual Label Explanations

Peichao Lai; Jiaxin Gan; Feiyang Ye; Yilei Wang; Bin Cui

arXiv:2501.19093·cs.CL·October 7, 2025

Improving Low-Resource Sequence Labeling with Knowledge Fusion and Contextual Label Explanations

Peichao Lai, Jiaxin Gan, Feiyang Ye, Yilei Wang, Bin Cui

PDF

Open Access 2 Models 1 Video

TL;DR

This paper introduces a novel framework combining knowledge enhancement and a span-based model to improve low-resource Chinese sequence labeling, achieving state-of-the-art results by mitigating semantic biases and enabling efficient nested entity extraction.

Contribution

It presents a new workflow with explanation prompts and a span-based model that together enhance understanding and extraction in low-resource, domain-specific Chinese sequence labeling tasks.

Findings

01

Achieves state-of-the-art performance on Chinese datasets.

02

Effectively mitigates semantic distribution biases.

03

Enables efficient nested entity extraction without external knowledge.

Abstract

Sequence labeling remains a significant challenge in low-resource, domain-specific scenarios, particularly for character-dense languages like Chinese. Existing methods primarily focus on enhancing model comprehension and improving data diversity to boost performance. However, these approaches still struggle with inadequate model applicability and semantic distribution biases in domain-specific contexts. To overcome these limitations, we propose a novel framework that combines an LLM-based knowledge enhancement workflow with a span-based Knowledge Fusion for Rich and Efficient Extraction (KnowFREE) model. Our workflow employs explanation prompts to generate precise contextual interpretations of target entities, effectively mitigating semantic biases and enriching the model's contextual understanding. The KnowFREE model further integrates extension label features, enabling efficient…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Videos

Improving Low-Resource Sequence Labeling with Knowledge Fusion and Contextual Label Explanations· underline

Taxonomy

TopicsRough Sets and Fuzzy Logic · Data Management and Algorithms

MethodsFocus