Fisher-Orthogonal Projected Natural Gradient Descent for Continual Learning
Ishir Garg, Neel Kolhe, Andy Peng, Rohan Gopalam

TL;DR
This paper introduces FOPNG, a novel optimizer for continual learning that uses Fisher-orthogonal constraints to prevent forgetting old tasks while learning new ones, unifying natural gradient and orthogonal gradient methods.
Contribution
The paper proposes FOPNG, a new optimizer that enforces Fisher-orthogonal constraints on parameter updates, combining natural gradient descent with orthogonal gradient methods for continual learning.
Findings
FOPNG outperforms existing methods on standard benchmarks.
The approach effectively prevents catastrophic forgetting.
Efficient implementation using diagonal Fisher is demonstrated.
Abstract
Continual learning aims to enable neural networks to acquire new knowledge on sequential tasks. However, the key challenge in such settings is to learn new tasks without catastrophically forgetting previously learned tasks. We propose the Fisher-Orthogonal Projected Natural Gradient Descent (FOPNG) optimizer, which enforces Fisher-orthogonal constraints on parameter updates to preserve old task performance while learning new tasks. Unlike existing methods that operate in Euclidean parameter space, FOPNG projects gradients onto the Fisher-orthogonal complement of previous task gradients. This approach unifies natural gradient descent with orthogonal gradient methods within an information-geometric framework. We provide theoretical analysis deriving the projected update, describe efficient and practical implementations using the diagonal Fisher, and demonstrate strong results on standard…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Stochastic Gradient Optimization Techniques · Face recognition and analysis
