TL;DR
This paper introduces PackNet, a method that uses iterative pruning and re-training to sequentially add multiple tasks to a single neural network, significantly reducing catastrophic forgetting and storage needs.
Contribution
The paper proposes a novel iterative pruning approach to pack multiple tasks into one network without proxy losses, improving robustness against forgetting.
Findings
Achieved near-separate-task accuracy on multiple classification tasks
Demonstrated robustness against catastrophic forgetting
Validated on large-scale datasets and various architectures
Abstract
This paper presents a method for adding multiple tasks to a single deep neural network while avoiding catastrophic forgetting. Inspired by network pruning techniques, we exploit redundancies in large deep networks to free up parameters that can then be employed to learn new tasks. By performing iterative pruning and network re-training, we are able to sequentially "pack" multiple tasks into a single network while ensuring minimal drop in performance and minimal storage overhead. Unlike prior work that uses proxy losses to maintain accuracy on older tasks, we always optimize for the task at hand. We perform extensive experiments on a variety of network architectures and large-scale datasets, and observe much better robustness against catastrophic forgetting than prior work. In particular, we are able to add three fine-grained classification tasks to a single ImageNet-trained VGG-16…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsPruning
