Multifaceted Analysis of Fine-Tuning in Deep Model for Visual Recognition
Xiangyang Li, Luis Herranz, and Shuqiang Jiang

TL;DR
This paper systematically investigates various factors affecting the performance of fine-tuning CNNs for visual recognition, providing empirical insights into how to optimize the process for better transfer learning outcomes.
Contribution
It introduces a comprehensive analysis of multiple factors influencing fine-tuning effectiveness in CNNs for visual recognition tasks, offering practical guidelines based on empirical evidence.
Findings
Fine-tuning parameters significantly impact performance.
Source-target data similarity affects transfer effectiveness.
Empirical guidelines for optimal fine-tuning strategies.
Abstract
In recent years, convolutional neural networks (CNNs) have achieved impressive performance for various visual recognition scenarios. CNNs trained on large labeled datasets can not only obtain significant performance on most challenging benchmarks but also provide powerful representations, which can be used to a wide range of other tasks. However, the requirement of massive amounts of data to train deep neural networks is a major drawback of these models, as the data available is usually limited or imbalanced. Fine-tuning (FT) is an effective way to transfer knowledge learned in a source dataset to a target task. In this paper, we introduce and systematically investigate several factors that influence the performance of fine-tuning for visual recognition. These factors include parameters for the retraining procedure (e.g., the initial learning rate of fine-tuning), the distribution of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Adversarial Robustness in Machine Learning
