The Expanding Scope of the Stability Gap: Unveiling its Presence in   Joint Incremental Learning of Homogeneous Tasks

Sandesh Kamath; Albin Soutif-Cormerais; Joost van de Weijer; Bogdan; Raducanu

arXiv:2406.05114·cs.LG·June 10, 2024

The Expanding Scope of the Stability Gap: Unveiling its Presence in Joint Incremental Learning of Homogeneous Tasks

Sandesh Kamath, Albin Soutif-Cormerais, Joost van de Weijer, Bogdan, Raducanu

PDF

Open Access

TL;DR

This paper reveals that the stability gap, previously observed in continual learning, also occurs in joint incremental training of homogeneous tasks, highlighting challenges in optimization paths and performance stability.

Contribution

It demonstrates the presence of the stability gap in joint incremental learning and analyzes the optimization dynamics, offering insights into potential solutions.

Findings

01

Stability gap occurs in joint incremental training of homogeneous tasks.

02

A low-loss linear path exists but SGD does not follow it.

03

Batch-wise analysis provides insights into optimization challenges.

Abstract

Recent research identified a temporary performance drop on previously learned tasks when transitioning to a new one. This drop is called the stability gap and has great consequences for continual learning: it complicates the direct employment of continually learning since the worse-case performance at task-boundaries is dramatic, it limits its potential as an energy-efficient training paradigm, and finally, the stability drop could result in a reduced final performance of the algorithm. In this paper, we show that the stability gap also occurs when applying joint incremental training of homogeneous tasks. In this scenario, the learner continues training on the same data distribution and has access to all data from previous tasks. In addition, we show that in this scenario, there exists a low-loss linear path to the next minima, but that SGD optimization does not choose this path. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInnovative Teaching and Learning Methods · Intelligent Tutoring Systems and Adaptive Learning · Online Learning and Analytics

MethodsStochastic Gradient Descent