WAVE++: Capturing Within-Task Variance for Continual Relation Extraction with Adaptive Prompting
Bao-Ngoc Dao, Minh Le, Quang Nguyen, Luyen Ngo Dinh, Nam Le, Linh Ngo Van

TL;DR
WAVE++ introduces a prompt-based continual relation extraction method that captures within-task and cross-task variations without storing past data, using task-specific prompt pools, label descriptions, and a generative model to improve accuracy and privacy.
Contribution
It proposes a novel prompt-based CRE approach with task-specific prompt pools, label descriptions, and a generative model to better handle task variations and eliminate data storage.
Findings
Outperforms state-of-the-art methods in CRE tasks.
Effectively captures within-task and cross-task variations.
Reduces memory usage and privacy concerns.
Abstract
Memory-based approaches have shown strong performance in Continual Relation Extraction (CRE). However, storing examples from previous tasks increases memory usage and raises privacy concerns. Recently, prompt-based methods have emerged as a promising alternative, as they do not rely on storing past samples. Despite this progress, current prompt-based techniques face several core challenges in CRE, particularly in accurately identifying task identities and mitigating catastrophic forgetting. Existing prompt selection strategies often suffer from inaccuracies, lack robust mechanisms to prevent forgetting in shared parameters, and struggle to handle both cross-task and within-task variations. In this paper, we propose WAVE++, a novel approach inspired by the connection between prefix-tuning and mixture of experts. Specifically, we introduce task-specific prompt pools that enhance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting · Human Pose and Action Recognition · Speech Recognition and Synthesis
