Bayesian Optimized Continual Learning with Attention Mechanism
Ju Xu, Jin Ma, Zhanxing Zhu

TL;DR
This paper introduces BOCL, a continual learning model that dynamically expands neural networks using Bayesian optimization and attention mechanisms to better learn new tasks while avoiding forgetting.
Contribution
It presents a novel continual learning approach combining Bayesian optimization and attention to adapt network capacity and reuse knowledge effectively.
Findings
Outperforms state-of-the-art in preventing catastrophic forgetting
Better fitting of new tasks on MNIST and CIFAR-100 variants
Demonstrates effective dynamic network expansion
Abstract
Though neural networks have achieved much progress in various applications, it is still highly challenging for them to learn from a continuous stream of tasks without forgetting. Continual learning, a new learning paradigm, aims to solve this issue. In this work, we propose a new model for continual learning, called Bayesian Optimized Continual Learning with Attention Mechanism (BOCL) that dynamically expands the network capacity upon the arrival of new tasks by Bayesian optimization and selectively utilizes previous knowledge (e.g. feature maps of previous tasks) via attention mechanism. Our experiments on variants of MNIST and CIFAR-100 demonstrate that our methods outperform the state-of-the-art in preventing catastrophic forgetting and fitting new tasks better.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Algorithms · Anomaly Detection Techniques and Applications
