Distilled Replay: Overcoming Forgetting through Synthetic Samples
Andrea Rosasco, Antonio Carta, Andrea Cossu, Vincenzo Lomonaco, Davide, Bacciu

TL;DR
This paper introduces Distilled Replay, a novel continual learning strategy that uses a small, distilled buffer of highly informative samples to effectively mitigate forgetting, reducing memory requirements.
Contribution
It proposes a distillation-based method to create a minimal yet highly informative replay buffer for continual learning.
Findings
Distilled Replay outperforms popular replay strategies on multiple benchmarks.
The method uses only 1 pattern per class in the buffer.
It effectively mitigates catastrophic forgetting with minimal memory.
Abstract
Replay strategies are Continual Learning techniques which mitigate catastrophic forgetting by keeping a buffer of patterns from previous experiences, which are interleaved with new data during training. The amount of patterns stored in the buffer is a critical parameter which largely influences the final performance and the memory footprint of the approach. This work introduces Distilled Replay, a novel replay strategy for Continual Learning which is able to mitigate forgetting by keeping a very small buffer (1 pattern per class) of highly informative samples. Distilled Replay builds the buffer through a distillation process which compresses a large dataset into a tiny set of informative examples. We show the effectiveness of our Distilled Replay against popular replay-based strategies on four Continual Learning benchmarks.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
