Task Relation Distillation and Prototypical Pseudo Label for Incremental   Named Entity Recognition

Duzhen Zhang; Hongliu Li; Wei Cong; Rongtao Xu; Jiahua Dong; Xiuyi; Chen

arXiv:2308.08793·cs.CL·August 21, 2023

Task Relation Distillation and Prototypical Pseudo Label for Incremental Named Entity Recognition

Duzhen Zhang, Hongliu Li, Wei Cong, Rongtao Xu, Jiahua Dong, Xiuyi, Chen

PDF

Open Access 1 Repo

TL;DR

This paper introduces RDP, a novel method for incremental NER that reduces catastrophic forgetting and background shift by using task relation distillation and prototypical pseudo labels, significantly improving performance.

Contribution

The paper proposes a new approach combining task relation distillation and prototypical pseudo labeling to enhance incremental NER, addressing key challenges of forgetting and background shift.

Findings

01

Achieves 6.08% higher Micro F1 score on average

02

Achieves 7.71% higher Macro F1 score on average

03

Outperforms previous state-of-the-art methods on multiple datasets

Abstract

Incremental Named Entity Recognition (INER) involves the sequential learning of new entity types without accessing the training data of previously learned types. However, INER faces the challenge of catastrophic forgetting specific for incremental learning, further aggravated by background shift (i.e., old and future entity types are labeled as the non-entity type in the current task). To address these challenges, we propose a method called task Relation Distillation and Prototypical pseudo label (RDP) for INER. Specifically, to tackle catastrophic forgetting, we introduce a task relation distillation scheme that serves two purposes: 1) ensuring inter-task semantic consistency across different incremental learning tasks by minimizing inter-task relation distillation loss, and 2) enhancing the model's prediction confidence by minimizing intra-task self-entropy loss. Simultaneously, to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bladedancer957/iner_rdp
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Domain Adaptation and Few-Shot Learning