Hierarchically Structured Task-Agnostic Continual Learning

Heinke Hihn; Daniel A. Braun

arXiv:2211.07725·cs.LG·November 16, 2022

Hierarchically Structured Task-Agnostic Continual Learning

Heinke Hihn, Daniel A. Braun

PDF

Open Access 1 Repo

TL;DR

This paper introduces a hierarchical, task-agnostic continual learning framework using a Bayesian information-theoretic principle, with a novel neural layer that manages task-specific paths to mitigate forgetting across diverse learning settings.

Contribution

It proposes a new optimality principle for task-agnostic continual learning and a Mixture-of-Variational-Experts layer that creates specialized paths to reduce forgetting.

Findings

01

Competitive performance on continual reinforcement learning tasks

02

Effective in handling multiple datasets like MNIST, CIFAR-10, CIFAR-100

03

Demonstrates task-agnostic learning without task-specific knowledge

Abstract

One notable weakness of current machine learning algorithms is the poor ability of models to solve new problems without forgetting previously acquired knowledge. The Continual Learning paradigm has emerged as a protocol to systematically investigate settings where the model sequentially observes samples generated by a series of tasks. In this work, we take a task-agnostic view of continual learning and develop a hierarchical information-theoretic optimality principle that facilitates a trade-off between learning and forgetting. We derive this principle from a Bayesian perspective and show its connections to previous approaches to continual learning. Based on this principle, we propose a neural network layer, called the Mixture-of-Variational-Experts layer, that alleviates forgetting by creating a set of information processing paths through the network which is governed by a gating…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hhihn/HVCL
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Machine Learning and Data Classification