Prompt Tuning for Few-Shot Continual Learning Named Entity Recognition
Zhe Ren

TL;DR
This paper introduces a prompt tuning approach with memory demonstration templates to improve few-shot continual learning for named entity recognition, addressing knowledge distillation challenges and enhancing performance.
Contribution
The paper proposes an expandable Anchor words-oriented Prompt Tuning paradigm and Memory Demonstration Templates to tackle few-shot CLNER challenges and improve model generalization.
Findings
Achieves competitive performance on FS-CLNER tasks.
Effectively mitigates the Few-Shot Distillation Dilemma.
Enhances in-context learning with memory templates.
Abstract
Knowledge distillation has been successfully applied to Continual Learning Named Entity Recognition (CLNER) tasks, by using a teacher model trained on old-class data to distill old-class entities present in new-class data as a form of regularization, thereby avoiding catastrophic forgetting. However, in Few-Shot CLNER (FS-CLNER) tasks, the scarcity of new-class entities makes it difficult for the trained model to generalize during inference. More critically, the lack of old-class entity information hinders the distillation of old knowledge, causing the model to fall into what we refer to as the Few-Shot Distillation Dilemma. In this work, we address the above challenges through a prompt tuning paradigm and memory demonstration template strategy. Specifically, we designed an expandable Anchor words-oriented Prompt Tuning (APT) paradigm to bridge the gap between pre-training and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
