Lifelong Learning with Dynamically Expandable Networks
Jaehong Yoon, Eunho Yang, Jeongtae Lee, Sung Ju Hwang

TL;DR
This paper introduces a dynamically expandable neural network architecture for lifelong learning that adapts its capacity over time, sharing knowledge efficiently among tasks and preventing semantic drift, while outperforming existing methods.
Contribution
The paper presents a novel network architecture, DEN, that dynamically expands and manages capacity for lifelong learning, improving performance and parameter efficiency.
Findings
Outperforms existing lifelong learning methods
Achieves comparable performance to batch models with fewer parameters
Fine-tuned network on all tasks surpasses batch models
Abstract
We propose a novel deep network architecture for lifelong learning which we refer to as Dynamically Expandable Network (DEN), that can dynamically decide its network capacity as it trains on a sequence of tasks, to learn a compact overlapping knowledge sharing structure among tasks. DEN is efficiently trained in an online manner by performing selective retraining, dynamically expands network capacity upon arrival of each task with only the necessary number of units, and effectively prevents semantic drift by splitting/duplicating units and timestamping them. We validate DEN on multiple public datasets under lifelong learning scenarios, on which it not only significantly outperforms existing lifelong learning methods for deep networks, but also achieves the same level of performance as the batch counterparts with substantially fewer number of parameters. Further, the obtained network…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · COVID-19 diagnosis using AI
