Investigating the Impact of Weight Sharing Decisions on Knowledge   Transfer in Continual Learning

Josh Andle; Ali Payani; Salimeh Yasaei-Sekeh

arXiv:2311.09506·cs.LG·December 20, 2023·1 cites

Investigating the Impact of Weight Sharing Decisions on Knowledge Transfer in Continual Learning

Josh Andle, Ali Payani, Salimeh Yasaei-Sekeh

PDF

Open Access

TL;DR

This paper explores how different weight sharing strategies in continual learning affect forward knowledge transfer, demonstrating that task complexity and similarity influence optimal sharing decisions to improve accuracy.

Contribution

It provides an analysis of weight sharing decisions in continual learning, revealing how task characteristics guide optimal sharing for better transfer and accuracy.

Findings

01

Task complexity and similarity influence optimal weight sharing.

02

Sharing decisions based on findings improve task accuracy.

03

Structured subnetworks facilitate better knowledge transfer.

Abstract

Continual Learning (CL) has generated attention as a method of avoiding Catastrophic Forgetting (CF) in the sequential training of neural networks, improving network efficiency and adaptability to different tasks. Additionally, CL serves as an ideal setting for studying network behavior and Forward Knowledge Transfer (FKT) between tasks. Pruning methods for CL train subnetworks to handle the sequential tasks which allows us to take a structured approach to investigating FKT. Sharing prior subnetworks' weights leverages past knowledge for the current task through FKT. Understanding which weights to share is important as sharing all weights can yield sub-optimal accuracy. This paper investigates how different sharing decisions affect the FKT between tasks. Through this lens we demonstrate how task complexity and similarity influence the optimal weight sharing decisions, giving insights…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning

MethodsPruning