How transferable are features in deep neural networks?

Jason Yosinski; Jeff Clune; Yoshua Bengio; Hod Lipson

arXiv:1411.1792·cs.LG·December 9, 2014·3.5k cites

How transferable are features in deep neural networks?

Jason Yosinski, Jeff Clune, Yoshua Bengio, Hod Lipson

PDF

Open Access 3 Repos 1 Models

TL;DR

This paper investigates how features learned by deep neural networks transfer across tasks, revealing that transferability varies by layer and task similarity, with transfer from distant tasks sometimes outperforming random initialization.

Contribution

The study quantifies the transferability of features in each network layer and uncovers unexpected effects of transfer on generalization and optimization.

Findings

01

Transferability decreases with increasing task dissimilarity.

02

Transferring features from various layers can improve generalization.

03

Transfer from distant tasks can outperform random features.

Abstract

Many deep neural networks trained on natural images exhibit a curious phenomenon in common: on the first layer they learn features similar to Gabor filters and color blobs. Such first-layer features appear not to be specific to a particular dataset or task, but general in that they are applicable to many datasets and tasks. Features must eventually transition from general to specific by the last layer of the network, but this transition has not been studied extensively. In this paper we experimentally quantify the generality versus specificity of neurons in each layer of a deep convolutional neural network and report a few surprising results. Transferability is negatively affected by two distinct issues: (1) the specialization of higher layer neurons to their original task at the expense of performance on the target task, which was expected, and (2) optimization difficulties related to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

🤗
HWresearch/GNN4Colliders
model

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning