Task agnostic continual learning with Pairwise layer architecture

Santtu Keskinen

arXiv:2405.13632·cs.LG·May 24, 2024

Task agnostic continual learning with Pairwise layer architecture

Santtu Keskinen

PDF

Open Access 1 Repo

TL;DR

This paper introduces a task-agnostic continual learning method using a pairwise interaction layer that enhances performance without relying on memory replay or task boundary information.

Contribution

It proposes a novel static architecture with a pairwise interaction layer that improves continual learning performance without task-specific mechanisms.

Findings

01

Achieves competitive results on MNIST and FashionMNIST.

02

Operates effectively in online streaming scenarios without task labels.

03

Does not require memory replay or task boundary detection.

Abstract

Most of the dominant approaches to continual learning are based on either memory replay, parameter isolation, or regularization techniques that require task boundaries to calculate task statistics. We propose a static architecture-based method that doesn't use any of these. We show that we can improve the continual learning performance by replacing the final layer of our networks with our pairwise interaction layer. The pairwise interaction layer uses sparse representations from a Winner-take-all style activation function to find the relevant correlations in the hidden layer representations. The networks using this architecture show competitive performance in MNIST and FashionMNIST-based continual image classification experiments. We demonstrate this in an online streaming continual learning setup where the learning system cannot access task labels or boundaries.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

skeskinen/pairwise_online_learning
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning