Improving Reliability of Fine-tuning with Block-wise Optimisation

Basel Barakat; Qiang Huang

arXiv:2301.06133·cs.LG·January 18, 2023·1 cites

Improving Reliability of Fine-tuning with Block-wise Optimisation

Basel Barakat, Qiang Huang

PDF

Open Access

TL;DR

This paper introduces a block-wise optimization method for fine-tuning pre-trained models, which adapt groups of layers to improve domain-specific task performance and reduce overfitting.

Contribution

It proposes a novel block-wise fine-tuning mechanism with four layer grouping strategies, enhancing adaptation effectiveness over traditional methods.

Findings

01

Outperforms baseline fine-tuning methods on Tf_flower dataset.

02

Effective in selecting salient layer groups for improved accuracy.

03

Reduces overfitting compared to full-layer re-optimization.

Abstract

Finetuning can be used to tackle domain-specific tasks by transferring knowledge. Previous studies on finetuning focused on adapting only the weights of a task-specific classifier or re-optimizing all layers of the pre-trained model using the new task data. The first type of methods cannot mitigate the mismatch between a pre-trained model and the new task data, and the second type of methods easily cause over-fitting when processing tasks with limited data. To explore the effectiveness of fine-tuning, we propose a novel block-wise optimization mechanism, which adapts the weights of a group of layers of a pre-trained model. In our work, the layer selection can be done in four different ways. The first is layer-wise adaptation, which aims to search for the most salient single layer according to the classification performance. The second way is based on the first one, jointly adapting a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsContext-Aware Activity Recognition Systems · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications