Hierarchically Structured Task-Agnostic Continual Learning
Heinke Hihn, Daniel A. Braun

TL;DR
This paper introduces a hierarchical, task-agnostic continual learning framework using a Bayesian information-theoretic principle, with a novel neural layer that manages task-specific paths to mitigate forgetting across diverse learning settings.
Contribution
It proposes a new optimality principle for task-agnostic continual learning and a Mixture-of-Variational-Experts layer that creates specialized paths to reduce forgetting.
Findings
Competitive performance on continual reinforcement learning tasks
Effective in handling multiple datasets like MNIST, CIFAR-10, CIFAR-100
Demonstrates task-agnostic learning without task-specific knowledge
Abstract
One notable weakness of current machine learning algorithms is the poor ability of models to solve new problems without forgetting previously acquired knowledge. The Continual Learning paradigm has emerged as a protocol to systematically investigate settings where the model sequentially observes samples generated by a series of tasks. In this work, we take a task-agnostic view of continual learning and develop a hierarchical information-theoretic optimality principle that facilitates a trade-off between learning and forgetting. We derive this principle from a Bayesian perspective and show its connections to previous approaches to continual learning. Based on this principle, we propose a neural network layer, called the Mixture-of-Variational-Experts layer, that alleviates forgetting by creating a set of information processing paths through the network which is governed by a gating…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Machine Learning and Data Classification
