HAT-CL: A Hard-Attention-to-the-Task PyTorch Library for Continual Learning
Xiaotian Duan

TL;DR
HAT-CL is a user-friendly PyTorch library that simplifies implementing the HAT mechanism for continual learning, improving usability, compatibility, and performance with novel mask techniques.
Contribution
We developed HAT-CL, a PyTorch-compatible toolkit that streamlines HAT integration and introduces new mask manipulation methods for better continual learning.
Findings
HAT-CL improves ease of use and integration in existing architectures.
Novel mask techniques enhance continual learning performance.
HAT-CL demonstrates consistent improvements across experiments.
Abstract
Catastrophic forgetting, the phenomenon in which a neural network loses previously obtained knowledge during the learning of new tasks, poses a significant challenge in continual learning. The Hard-Attention-to-the-Task (HAT) mechanism has shown potential in mitigating this problem, but its practical implementation has been complicated by issues of usability and compatibility, and a lack of support for existing network reuse. In this paper, we introduce HAT-CL, a user-friendly, PyTorch-compatible redesign of the HAT mechanism. HAT-CL not only automates gradient manipulation but also streamlines the transformation of PyTorch modules into HAT modules. It achieves this by providing a comprehensive suite of modules that can be seamlessly integrated into existing architectures. Additionally, HAT-CL offers ready-to-use HAT networks that are smoothly integrated with the TIMM library. Beyond…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Neural Network Applications
