Distilled Replay: Overcoming Forgetting through Synthetic Samples

Andrea Rosasco; Antonio Carta; Andrea Cossu; Vincenzo Lomonaco; Davide; Bacciu

arXiv:2103.15851·cs.LG·June 23, 2021

Distilled Replay: Overcoming Forgetting through Synthetic Samples

Andrea Rosasco, Antonio Carta, Andrea Cossu, Vincenzo Lomonaco, Davide, Bacciu

PDF

Open Access 2 Repos

TL;DR

This paper introduces Distilled Replay, a novel continual learning strategy that uses a small, distilled buffer of highly informative samples to effectively mitigate forgetting, reducing memory requirements.

Contribution

It proposes a distillation-based method to create a minimal yet highly informative replay buffer for continual learning.

Findings

01

Distilled Replay outperforms popular replay strategies on multiple benchmarks.

02

The method uses only 1 pattern per class in the buffer.

03

It effectively mitigates catastrophic forgetting with minimal memory.

Abstract

Replay strategies are Continual Learning techniques which mitigate catastrophic forgetting by keeping a buffer of patterns from previous experiences, which are interleaved with new data during training. The amount of patterns stored in the buffer is a critical parameter which largely influences the final performance and the memory footprint of the approach. This work introduces Distilled Replay, a novel replay strategy for Continual Learning which is able to mitigate forgetting by keeping a very small buffer (1 pattern per class) of highly informative samples. Distilled Replay builds the buffer through a distillation process which compresses a large dataset into a tiny set of informative examples. We show the effectiveness of our Distilled Replay against popular replay-based strategies on four Continual Learning benchmarks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications