Continual Learning with Dependency Preserving Hypernetworks

Dupati Srikar Chandra; Sakshi Varshney; P.K. Srijith; Sunil Gupta

arXiv:2209.07712·cs.LG·September 19, 2022

Continual Learning with Dependency Preserving Hypernetworks

Dupati Srikar Chandra, Sakshi Varshney, P.K. Srijith, Sunil Gupta

PDF

Open Access 1 Video

TL;DR

This paper introduces a dependency preserving hypernetwork using RNNs for continual learning, which effectively generates task-specific weights while maintaining parameter efficiency, thereby reducing catastrophic forgetting.

Contribution

It proposes a novel RNN-based hypernetwork that preserves dependencies across layers and introduces regularisation and growth techniques to enhance continual learning performance.

Findings

01

Outperforms baseline methods in multiple image classification CL tasks

02

Effectively maintains dependencies across network layers

03

Reduces catastrophic forgetting in continual learning scenarios

Abstract

Humans learn continually throughout their lifespan by accumulating diverse knowledge and fine-tuning it for future tasks. When presented with a similar goal, neural networks suffer from catastrophic forgetting if data distributions across sequential tasks are not stationary over the course of learning. An effective approach to address such continual learning (CL) problems is to use hypernetworks which generate task dependent weights for a target network. However, the continual learning performance of existing hypernetwork based approaches are affected by the assumption of independence of the weights across the layers in order to maintain parameter efficiency. To address this limitation, we propose a novel approach that uses a dependency preserving hypernetwork to generate weights for the target network while also maintaining the parameter efficiency. We propose to use recurrent neural…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Continual Learning with Dependency Preserving Hypernetworks· youtube

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning

MethodsHyperNetwork