Serial Contrastive Knowledge Distillation for Continual Few-shot   Relation Extraction

Xinyi Wang; Zitao Wang; Wei Hu

arXiv:2305.06616·cs.CL·May 12, 2023·2 cites

Serial Contrastive Knowledge Distillation for Continual Few-shot Relation Extraction

Xinyi Wang, Zitao Wang, Wei Hu

PDF

Open Access 1 Repo

TL;DR

This paper introduces SCKD, a novel model for continual few-shot relation extraction that addresses catastrophic forgetting and data sparsity through serial knowledge distillation and contrastive learning, demonstrating superior performance on benchmarks.

Contribution

The paper proposes SCKD, a new approach combining serial knowledge distillation and contrastive learning to improve continual few-shot relation extraction.

Findings

01

SCKD outperforms state-of-the-art models on benchmark datasets.

02

SCKD effectively mitigates catastrophic forgetting.

03

SCKD enhances knowledge transfer and memory utilization.

Abstract

Continual few-shot relation extraction (RE) aims to continuously train a model for new relations with few labeled training data, of which the major challenges are the catastrophic forgetting of old relations and the overfitting caused by data sparsity. In this paper, we propose a new model, namely SCKD, to accomplish the continual few-shot RE task. Specifically, we design serial knowledge distillation to preserve the prior knowledge from previous models and conduct contrastive learning with pseudo samples to keep the representations of samples in different relations sufficiently distinguishable. Our experiments on two benchmark datasets validate the effectiveness of SCKD for continual few-shot RE and its superiority in knowledge transfer and memory utilization over state-of-the-art models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nju-websoft/sckd
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Interpreting and Communication in Healthcare

MethodsKnowledge Distillation · Contrastive Learning