What is being transferred in transfer learning?

Behnam Neyshabur; Hanie Sedghi; Chiyuan Zhang

arXiv:2008.11687·cs.LG·January 18, 2021·195 cites

What is being transferred in transfer learning?

Behnam Neyshabur, Hanie Sedghi, Chiyuan Zhang

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper investigates the mechanisms behind transfer learning in deep neural networks, distinguishing between feature reuse and learning data statistics, and analyzing the effects of pre-training on model behavior.

Contribution

It introduces new analytical tools to understand what aspects of models are transferred and how pre-trained weights influence model stability and similarity.

Findings

01

Transfer learning benefits partly from learning data statistics.

02

Pre-trained models tend to stay in the same loss landscape basin.

03

Models initialized with pre-trained weights are similar in feature and parameter space.

Abstract

One desired capability for machines is the ability to transfer their knowledge of one domain to another where data is (usually) scarce. Despite ample adaptation of transfer learning in various deep learning applications, we yet do not understand what enables a successful transfer and which part of the network is responsible for that. In this paper, we provide new tools and analyses to address these fundamental questions. Through a series of analyses on transferring to block-shuffled images, we separate the effect of feature reuse from learning low-level statistics of data and show that some benefit of transfer learning comes from the latter. We present that when training from pre-trained weights, the model stays in the same basin in the loss landscape and different instances of such model are similar in feature space and close in parameter space.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

google-research/understanding-transfer-learning
pytorchOfficial

Videos

What is being transferred in transfer learning?· slideslive

Taxonomy

TopicsInterpreting and Communication in Healthcare · Topic Modeling · Domain Adaptation and Few-Shot Learning