Entropy-based Stability-Plasticity for Lifelong Learning

Vladimir Araujo; Julio Hurtado; Alvaro Soto; Marie-Francine Moens

arXiv:2204.09517·cs.LG·April 21, 2022

Entropy-based Stability-Plasticity for Lifelong Learning

Vladimir Araujo, Julio Hurtado, Alvaro Soto, Marie-Francine Moens

PDF

Open Access 1 Repo

TL;DR

This paper introduces an entropy-based method to dynamically balance stability and plasticity in neural networks, improving lifelong learning by reducing interference and enabling layer freezing for faster training.

Contribution

The proposed Entropy-based Stability-Plasticity (ESP) method adaptively adjusts layer plasticity using entropy criteria, addressing the stability-plasticity dilemma in lifelong learning.

Findings

01

Effective in natural language and vision tasks

02

Reduces interference and leverages prior knowledge

03

Enables layer freezing to speed up training

Abstract

The ability to continuously learn remains elusive for deep learning models. Unlike humans, models cannot accumulate knowledge in their weights when learning new tasks, mainly due to an excess of plasticity and the low incentive to reuse weights when training a new task. To address the stability-plasticity dilemma in neural networks, we propose a novel method called Entropy-based Stability-Plasticity (ESP). Our approach can decide dynamically how much each model layer should be modified via a plasticity factor. We incorporate branch layers and an entropy-based criterion into the model to find such factor. Our experiments in the domains of natural language and vision show the effectiveness of our approach in leveraging prior knowledge by reducing interference. Also, in some cases, it is possible to freeze layers during training leading to speed up in training.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vgaraujov/esp-cl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Domain Adaptation and Few-Shot Learning

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings