Consistent Representation Learning for Continual Relation Extraction
Kang Zhao, Hua Xu, Jiangong Yang, Kai Gao

TL;DR
This paper introduces a consistent representation learning approach for continual relation extraction that leverages contrastive learning and knowledge distillation to reduce forgetting and improve performance on imbalanced datasets.
Contribution
It proposes a novel method combining supervised contrastive learning and memory knowledge distillation to maintain stable relation embeddings during continual learning.
Findings
Significantly outperforms state-of-the-art baselines.
Shows strong robustness on imbalanced datasets.
Effectively alleviates catastrophic forgetting.
Abstract
Continual relation extraction (CRE) aims to continuously train a model on data with new relations while avoiding forgetting old ones. Some previous work has proved that storing a few typical samples of old relations and replaying them when learning new relations can effectively avoid forgetting. However, these memory-based methods tend to overfit the memory samples and perform poorly on imbalanced datasets. To solve these challenges, a consistent representation learning method is proposed, which maintains the stability of the relation embedding by adopting contrastive learning and knowledge distillation when replaying memory. Specifically, supervised contrastive learning based on a memory bank is first used to train each new task so that the model can effectively learn the relation representation. Then, contrastive replay is conducted of the samples in memory and makes the model retain…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsContrastive Learning · Supervised Contrastive Loss · Knowledge Distillation
