Continual Learning for Recurrent Neural Networks: an Empirical Evaluation
Andrea Cossu, Antonio Carta, Vincenzo Lomonaco, Davide Bacciu

TL;DR
This paper reviews and categorizes continual learning approaches for recurrent neural networks, introduces new benchmarks based on real-world data, and empirically evaluates strategies to mitigate forgetting in sequential data processing.
Contribution
It provides a comprehensive literature review, proposes two new benchmarks for CL with sequential data, and conducts an extensive empirical evaluation of CL strategies for RNNs.
Findings
Sequence length significantly impacts CL performance.
Clear scenario specification is crucial for effective CL.
Different strategies vary in their ability to mitigate forgetting.
Abstract
Learning continuously during all model lifetime is fundamental to deploy machine learning solutions robust to drifts in the data distribution. Advances in Continual Learning (CL) with recurrent neural networks could pave the way to a large number of applications where incoming data is non stationary, like natural language processing and robotics. However, the existing body of work on the topic is still fragmented, with approaches which are application-specific and whose assessment is based on heterogeneous learning protocols and datasets. In this paper, we organize the literature on CL for sequential data processing by providing a categorization of the contributions and a review of the benchmarks. We propose two new benchmarks for CL with sequential data based on existing datasets, whose characteristics resemble real-world applications. We also provide a broad empirical evaluation of CL…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
