Investigating the Impact of Weight Sharing Decisions on Knowledge Transfer in Continual Learning
Josh Andle, Ali Payani, Salimeh Yasaei-Sekeh

TL;DR
This paper explores how different weight sharing strategies in continual learning affect forward knowledge transfer, demonstrating that task complexity and similarity influence optimal sharing decisions to improve accuracy.
Contribution
It provides an analysis of weight sharing decisions in continual learning, revealing how task characteristics guide optimal sharing for better transfer and accuracy.
Findings
Task complexity and similarity influence optimal weight sharing.
Sharing decisions based on findings improve task accuracy.
Structured subnetworks facilitate better knowledge transfer.
Abstract
Continual Learning (CL) has generated attention as a method of avoiding Catastrophic Forgetting (CF) in the sequential training of neural networks, improving network efficiency and adaptability to different tasks. Additionally, CL serves as an ideal setting for studying network behavior and Forward Knowledge Transfer (FKT) between tasks. Pruning methods for CL train subnetworks to handle the sequential tasks which allows us to take a structured approach to investigating FKT. Sharing prior subnetworks' weights leverages past knowledge for the current task through FKT. Understanding which weights to share is important as sharing all weights can yield sub-optimal accuracy. This paper investigates how different sharing decisions affect the FKT between tasks. Through this lens we demonstrate how task complexity and similarity influence the optimal weight sharing decisions, giving insights…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning
MethodsPruning
