Gradual Tuning: a better way of Fine Tuning the parameters of a Deep   Neural Network

Guglielmo Montone; J. Kevin O'Regan; Alexander V. Terekhov

arXiv:1711.10177·cs.AI·November 29, 2017·5 cites

Gradual Tuning: a better way of Fine Tuning the parameters of a Deep Neural Network

Guglielmo Montone, J. Kevin O'Regan, Alexander V. Terekhov

PDF

Open Access

TL;DR

Gradual Tuning is a novel fine-tuning method that progressively adjusts more network parameters, reducing catastrophic forgetting and maintaining or improving performance on new tasks.

Contribution

The paper introduces Gradual Tuning, an alternative fine-tuning strategy that incrementally modifies network parameters to better preserve prior knowledge during transfer learning.

Findings

01

Reduces catastrophic forgetting compared to standard fine-tuning.

02

Maintains or improves performance on the new task.

03

Effective across different network sizes and regularization techniques.

Abstract

In this paper we present an alternative strategy for fine-tuning the parameters of a network. We named the technique Gradual Tuning. Once trained on a first task, the network is fine-tuned on a second task by modifying a progressively larger set of the network's parameters. We test Gradual Tuning on different transfer learning tasks, using networks of different sizes trained with different regularization techniques. The result shows that compared to the usual fine tuning, our approach significantly reduces catastrophic forgetting of the initial task, while still retaining comparable if not better performance on the new task.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Model Reduction and Neural Networks · Seismology and Earthquake Studies