HAT-CL: A Hard-Attention-to-the-Task PyTorch Library for Continual   Learning

Xiaotian Duan

arXiv:2307.09653·cs.LG·February 6, 2024·2 cites

HAT-CL: A Hard-Attention-to-the-Task PyTorch Library for Continual Learning

Xiaotian Duan

PDF

Open Access 1 Repo

TL;DR

HAT-CL is a user-friendly PyTorch library that simplifies implementing the HAT mechanism for continual learning, improving usability, compatibility, and performance with novel mask techniques.

Contribution

We developed HAT-CL, a PyTorch-compatible toolkit that streamlines HAT integration and introduces new mask manipulation methods for better continual learning.

Findings

01

HAT-CL improves ease of use and integration in existing architectures.

02

Novel mask techniques enhance continual learning performance.

03

HAT-CL demonstrates consistent improvements across experiments.

Abstract

Catastrophic forgetting, the phenomenon in which a neural network loses previously obtained knowledge during the learning of new tasks, poses a significant challenge in continual learning. The Hard-Attention-to-the-Task (HAT) mechanism has shown potential in mitigating this problem, but its practical implementation has been complicated by issues of usability and compatibility, and a lack of support for existing network reuse. In this paper, we introduce HAT-CL, a user-friendly, PyTorch-compatible redesign of the HAT mechanism. HAT-CL not only automates gradient manipulation but also streamlines the transformation of PyTorch modules into HAT modules. It achieves this by providing a comprehensive suite of modules that can be seamlessly integrated into existing architectures. Additionally, HAT-CL offers ready-to-use HAT networks that are smoothly integrated with the TIMM library. Beyond…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xduan7/hat-cl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Neural Network Applications