On Learning the Geodesic Path for Incremental Learning

Christian Simon; Piotr Koniusz; Mehrtash Harandi

arXiv:2104.08572·cs.LG·April 20, 2021

On Learning the Geodesic Path for Incremental Learning

Christian Simon, Piotr Koniusz, Mehrtash Harandi

PDF

1 Repo

TL;DR

This paper introduces a novel incremental learning method that constructs low-dimensional manifolds for responses and minimizes dissimilarity along their geodesic, effectively reducing catastrophic forgetting.

Contribution

It proposes a new knowledge distillation technique using geodesic paths between response manifolds, improving retention of past knowledge.

Findings

01

More effective preservation of previous knowledge

02

Smooth response transitions along geodesic paths

03

Empirical results show improved incremental learning performance

Abstract

Neural networks notoriously suffer from the problem of catastrophic forgetting, the phenomenon of forgetting the past knowledge when acquiring new knowledge. Overcoming catastrophic forgetting is of significant importance to emulate the process of "incremental learning", where the model is capable of learning from sequential experience in an efficient and robust way. State-of-the-art techniques for incremental learning make use of knowledge distillation towards preventing catastrophic forgetting. Therein, one updates the network while ensuring that the network's responses to previously seen concepts remain stable throughout updates. This in practice is done by minimizing the dissimilarity between current and previous responses of the network one way or another. Our work contributes a novel method to the arsenal of distillation techniques. In contrast to the previous state of the art, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

chrysts/geodesic_continual_learning
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsKnowledge Distillation