Distance-Based Regularisation of Deep Networks for Fine-Tuning

Henry Gouk; Timothy M. Hospedales; Massimiliano Pontil

arXiv:2002.08253·stat.ML·January 18, 2021·6 cites

Distance-Based Regularisation of Deep Networks for Fine-Tuning

Henry Gouk, Timothy M. Hospedales, Massimiliano Pontil

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a distance-based regularisation method for fine-tuning deep neural networks, providing theoretical generalisation bounds and demonstrating improved empirical performance over existing methods.

Contribution

It proposes a novel regularisation approach that constrains weight updates during fine-tuning, backed by theoretical bounds and superior empirical results.

Findings

01

The proposed method achieves better generalisation bounds.

02

Empirical results show improved fine-tuning performance.

03

Outperforms state-of-the-art fine-tuning techniques.

Abstract

We investigate approaches to regularisation during fine-tuning of deep neural networks. First we provide a neural network generalisation bound based on Rademacher complexity that uses the distance the weights have moved from their initial values. This bound has no direct dependence on the number of weights and compares favourably to other bounds when applied to convolutional networks. Our bound is highly relevant for fine-tuning, because providing a network with a good initialisation based on transfer learning means that learning can modify the weights less, and hence achieve tighter generalisation. Inspired by this, we develop a simple yet effective fine-tuning algorithm that constrains the hypothesis class to a small sphere centred on the initial pre-trained weights, thus obtaining provably better generalisation performance than conventional transfer learning. Empirical evaluation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

henrygouk/mars-finetuning
tfOfficial

Videos

Distance-Based Regularisation of Deep Networks for Fine-Tuning· slideslive

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Sparse and Compressive Sensing Techniques