Making Pre-trained Language Models Better Continual Few-Shot Relation   Extractors

Shengkun Ma; Jiale Han; Yi Liang; Bo Cheng

arXiv:2402.15713·cs.CL·February 27, 2024·3 cites

Making Pre-trained Language Models Better Continual Few-Shot Relation Extractors

Shengkun Ma, Jiale Han, Yi Liang, Bo Cheng

PDF

Open Access 1 Repo

TL;DR

This paper introduces a contrastive prompt learning framework that enhances pre-trained language models for continual few-shot relation extraction, effectively reducing catastrophic forgetting and overfitting in low-resource settings.

Contribution

It proposes a novel contrastive prompt learning approach with memory augmentation, improving continual learning capabilities of language models for relation extraction.

Findings

01

Outperforms state-of-the-art methods significantly

02

Reduces catastrophic forgetting in continual learning

03

Mitigates overfitting in low-resource scenarios

Abstract

Continual Few-shot Relation Extraction (CFRE) is a practical problem that requires the model to continuously learn novel relations while avoiding forgetting old ones with few labeled training data. The primary challenges are catastrophic forgetting and overfitting. This paper harnesses prompt learning to explore the implicit capabilities of pre-trained language models to address the above two challenges, thereby making language models better continual few-shot relation extractors. Specifically, we propose a Contrastive Prompt Learning framework, which designs prompt representation to acquire more generalized knowledge that can be easily adapted to old and new categories, and margin-based contrastive learning to focus more on hard samples, therefore alleviating catastrophic forgetting and overfitting issues. To further remedy overfitting in low-resource scenarios, we introduce an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mashengkun/cpl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis

MethodsFocus · Contrastive Learning