Prompt Tuning for Few-Shot Continual Learning Named Entity Recognition

Zhe Ren

arXiv:2508.07248·cs.CL·August 12, 2025

Prompt Tuning for Few-Shot Continual Learning Named Entity Recognition

Zhe Ren

PDF

Open Access

TL;DR

This paper introduces a prompt tuning approach with memory demonstration templates to improve few-shot continual learning for named entity recognition, addressing knowledge distillation challenges and enhancing performance.

Contribution

The paper proposes an expandable Anchor words-oriented Prompt Tuning paradigm and Memory Demonstration Templates to tackle few-shot CLNER challenges and improve model generalization.

Findings

01

Achieves competitive performance on FS-CLNER tasks.

02

Effectively mitigates the Few-Shot Distillation Dilemma.

03

Enhances in-context learning with memory templates.

Abstract

Knowledge distillation has been successfully applied to Continual Learning Named Entity Recognition (CLNER) tasks, by using a teacher model trained on old-class data to distill old-class entities present in new-class data as a form of regularization, thereby avoiding catastrophic forgetting. However, in Few-Shot CLNER (FS-CLNER) tasks, the scarcity of new-class entities makes it difficult for the trained model to generalize during inference. More critically, the lack of old-class entity information hinders the distillation of old knowledge, causing the model to fall into what we refer to as the Few-Shot Distillation Dilemma. In this work, we address the above challenges through a prompt tuning paradigm and memory demonstration template strategy. Specifically, we designed an expandable Anchor words-oriented Prompt Tuning (APT) paradigm to bridge the gap between pre-training and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications