TL;DR
This paper introduces Latent Replay, a novel method for continual learning that stores intermediate layer activations instead of raw data, reducing computational load and enabling near real-time learning on edge devices.
Contribution
The paper proposes Latent Replay, a new rehearsal technique that improves continual learning efficiency by storing activations at intermediate layers, combined with layer-wise learning rate adjustments.
Findings
Achieves state-of-the-art results on CORe50 NICv2 and OpenLORIS benchmarks.
Enables near real-time continual learning on a smartphone device.
Reduces storage and computation compared to traditional rehearsal methods.
Abstract
Training deep neural networks at the edge on light computational devices, embedded systems and robotic platforms is nowadays very challenging. Continual learning techniques, where complex models are incrementally trained on small batches of new data, can make the learning problem tractable even for CPU-only embedded devices enabling remarkable levels of adaptiveness and autonomy. However, a number of practical problems need to be solved: catastrophic forgetting before anything else. In this paper we introduce an original technique named "Latent Replay" where, instead of storing a portion of past data in the input space, we store activations volumes at some intermediate layer. This can significantly reduce the computation and storage required by native rehearsal. To keep the representation stable and the stored activations valid we propose to slow-down learning at all the layers below…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
