Combating catastrophic forgetting with developmental compression
Shawn L.E. Beaulieu, Sam Kriegman, Josh C. Bongard

TL;DR
This paper introduces developmental compression, a scalable method to mitigate catastrophic forgetting in neural networks by gradually compressing specialized networks into a generalized model, validated on robot control tasks.
Contribution
It presents a novel developmental compression technique that reduces catastrophic forgetting without requiring domain knowledge, improving scalability over existing methods.
Findings
Reduces catastrophic forgetting in neural networks.
Produces generalized models without overt specialization.
Validated on robot control problem.
Abstract
Generally intelligent agents exhibit successful behavior across problems in several settings. Endemic in approaches to realize such intelligence in machines is catastrophic forgetting: sequential learning corrupts knowledge obtained earlier in the sequence, or tasks antagonistically compete for system resources. Methods for obviating catastrophic forgetting have sought to identify and preserve features of the system necessary to solve one problem when learning to solve another, or to enforce modularity such that minimally overlapping sub-functions contain task specific knowledge. While successful, both approaches scale poorly because they require larger architectures as the number of training instances grows, causing different parts of the system to specialize for separate subsets of the data. Here we present a method for addressing catastrophic forgetting called developmental…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
