Continual Learning for Recurrent Neural Networks: an Empirical   Evaluation

Andrea Cossu; Antonio Carta; Vincenzo Lomonaco; Davide Bacciu

arXiv:2103.07492·cs.LG·August 3, 2021

Continual Learning for Recurrent Neural Networks: an Empirical Evaluation

Andrea Cossu, Antonio Carta, Vincenzo Lomonaco, Davide Bacciu

PDF

1 Datasets

TL;DR

This paper reviews and categorizes continual learning approaches for recurrent neural networks, introduces new benchmarks based on real-world data, and empirically evaluates strategies to mitigate forgetting in sequential data processing.

Contribution

It provides a comprehensive literature review, proposes two new benchmarks for CL with sequential data, and conducts an extensive empirical evaluation of CL strategies for RNNs.

Findings

01

Sequence length significantly impacts CL performance.

02

Clear scenario specification is crucial for effective CL.

03

Different strategies vary in their ability to mitigate forgetting.

Abstract

Learning continuously during all model lifetime is fundamental to deploy machine learning solutions robust to drifts in the data distribution. Advances in Continual Learning (CL) with recurrent neural networks could pave the way to a large number of applications where incoming data is non stationary, like natural language processing and robotics. However, the existing body of work on the topic is still fragmented, with approaches which are application-specific and whose assessment is based on heterogeneous learning protocols and datasets. In this paper, we organize the literature on CL for sequential data processing by providing a categorization of the contributions and a review of the benchmarks. We propose two new benchmarks for CL with sequential data based on existing datasets, whose characteristics resemble real-world applications. We also provide a broad empirical evaluation of CL…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Kylan12/Synthetic-AI-ML-Dataset
dataset· 42 dl
42 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.