Explicit Inductive Bias for Transfer Learning with Convolutional   Networks

Xuhong Li; Yves Grandvalet; Franck Davoine

arXiv:1802.01483·cs.LG·June 7, 2018·100 cites

Explicit Inductive Bias for Transfer Learning with Convolutional Networks

Xuhong Li, Yves Grandvalet, Franck Davoine

PDF

Open Access 3 Repos

TL;DR

This paper explores regularization techniques that explicitly encourage similarity to pre-trained models during transfer learning with convolutional networks, improving fine-tuning effectiveness.

Contribution

It introduces and evaluates regularization schemes, especially an $L^2$ penalty, to explicitly incorporate inductive bias towards the pre-trained model in transfer learning.

Findings

01

Explicit bias improves transfer learning performance

02

L2 penalty with pre-trained model as reference is effective

03

Regularization schemes outperform standard fine-tuning methods

Abstract

In inductive transfer learning, fine-tuning pre-trained convolutional networks substantially outperforms training from scratch. When using fine-tuning, the underlying assumption is that the pre-trained model extracts generic features, which are at least partially relevant for solving the target task, but would be difficult to extract from the limited amount of data available on the target task. However, besides the initialization with the pre-trained model and the early stopping, there is no mechanism in fine-tuning for retaining the features learned on the source task. In this paper, we investigate several regularization schemes that explicitly promote the similarity of the final solution with the initial model. We show the benefit of having an explicit inductive bias towards the initial model, and we eventually recommend a simple $L^{2}$ penalty with the pre-trained model being a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Face recognition and analysis · Machine Learning and ELM