Gated Transfer Network for Transfer Learning
Yi Zhu, Jia Xue, Shawn Newsam

TL;DR
This paper introduces a Gated Transfer Network that enhances transfer learning by selectively weighting features from pre-trained models, leading to improved performance across multiple tasks with minimal additional computation.
Contribution
The paper proposes a novel transfer module for feature selection in fine-tuning, significantly boosting transfer learning effectiveness and introducing the Gated Transfer Network architecture.
Findings
Achieved state-of-the-art results on six tasks.
Feature selection via the transfer module improves transfer learning.
Incorporating an auxiliary classifier reduces overfitting.
Abstract
Deep neural networks have led to a series of breakthroughs in computer vision given sufficient annotated training datasets. For novel tasks with limited labeled data, the prevalent approach is to transfer the knowledge learned in the pre-trained models to the new tasks by fine-tuning. Classic model fine-tuning utilizes the fact that well trained neural networks appear to learn cross domain features. These features are treated equally during transfer learning. In this paper, we explore the impact of feature selection in model fine-tuning by introducing a transfer module, which assigns weights to features extracted from pre-trained models. The proposed transfer module proves the importance of feature selection for transferring models from source to target domains. It is shown to significantly improve upon fine-tuning results with only marginal extra computational cost. We also incorporate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · COVID-19 diagnosis using AI
