Transfer Learning via Test-Time Neural Networks Aggregation
Bruno Casella, Alessio Barbaro Chisari, Sebastiano Battiato, Mario, Valerio Giuffrida

TL;DR
This paper introduces a novel test-time neural network aggregation method for transfer learning that avoids additional training, mitigates catastrophic forgetting, and enables selective forgetting through simple arithmetic operations.
Contribution
It proposes a unified framework for model aggregation at test time, eliminating the need for extra training and addressing catastrophic forgetting.
Findings
Achieves comparable performance to baseline models.
Enables model aggregation without additional training.
Allows for selective forgetting if the aggregation operator has an inverse.
Abstract
It has been demonstrated that deep neural networks outperform traditional machine learning. However, deep networks lack generalisability, that is, they will not perform as good as in a new (testing) set drawn from a different distribution due to the domain shift. In order to tackle this known issue, several transfer learning approaches have been proposed, where the knowledge of a trained model is transferred into another to improve performance with different data. However, most of these approaches require additional training steps, or they suffer from catastrophic forgetting that occurs when a trained model has overwritten previously learnt knowledge. We address both problems with a novel transfer learning approach that uses network aggregation. We train dataset-specific networks together with an aggregation network in a unified framework. The loss function includes two main components:…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsTest
