A Prototypical Semantic Decoupling Method via Joint Contrastive Learning   for Few-Shot Name Entity Recognition

Guanting Dong; Zechen Wang; Liwen Wang; Daichi Guo; Dayuan; Fu; Yuxiang Wu; Chen Zeng; Xuefeng Li; Tingfeng Hui; Keqing He; and Xinyue Cui; Qixiang Gao; Weiran Xu

arXiv:2302.13610·cs.CL·April 13, 2023

A Prototypical Semantic Decoupling Method via Joint Contrastive Learning for Few-Shot Name Entity Recognition

Guanting Dong, Zechen Wang, Liwen Wang, Daichi Guo, Dayuan, Fu, Yuxiang Wu, Chen Zeng, Xuefeng Li, Tingfeng Hui, Keqing He, and Xinyue Cui, Qixiang Gao, Weiran Xu

PDF

Open Access

TL;DR

This paper introduces PSDC, a novel few-shot NER method that decouples semantic information via contrastive learning, leading to improved performance and better generalization over previous state-of-the-art models.

Contribution

The paper proposes a new decoupling approach with joint contrastive learning for few-shot NER, addressing prototype confusion and enhancing semantic representation.

Findings

01

Outperforms previous SOTA methods on two benchmarks

02

Effectively decouples class-specific and contextual semantics

03

Demonstrates strong generalization and robustness

Abstract

Few-shot named entity recognition (NER) aims at identifying named entities based on only few labeled instances. Most existing prototype-based sequence labeling models tend to memorize entity mentions which would be easily confused by close prototypes. In this paper, we proposed a Prototypical Semantic Decoupling method via joint Contrastive learning (PSDC) for few-shot NER. Specifically, we decouple class-specific prototypes and contextual semantic prototypes by two masking strategies to lead the model to focus on two different semantic information for inference. Besides, we further introduce joint contrastive learning objectives to better integrate two kinds of decoupling information and prevent semantic collapse. Experimental results on two few-shot NER benchmarks demonstrate that PSDC consistently outperforms the previous SOTA methods in terms of overall performance. Extensive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Interpreting and Communication in Healthcare

MethodsContrastive Learning