Non-binary deep transfer learning for image classification
Jo Plested, Xuyang Shen, and Tom Gedeon

TL;DR
This paper introduces a non-binary transfer learning approach for image classification that optimizes hyperparameters and regularization strategies, leading to improved performance over traditional binary methods.
Contribution
It proposes a non-binary transfer learning framework with empirical methods for hyperparameter tuning, surpassing traditional binary transfer learning approaches.
Findings
Achieves near or surpasses state-of-the-art results on challenging tasks.
Demonstrates the importance of hyperparameter tuning in transfer learning.
Shows benefits of combining L2SP and L2 regularization.
Abstract
The current standard for a variety of computer vision tasks using smaller numbers of labelled training examples is to fine-tune from weights pre-trained on a large image classification dataset such as ImageNet. The application of transfer learning and transfer learning methods tends to be rigidly binary. A model is either pre-trained or not pre-trained. Pre-training a model either increases performance or decreases it, the latter being defined as negative transfer. Application of L2-SP regularisation that decays the weights towards their pre-trained values is either applied or all weights are decayed towards 0. This paper re-examines these assumptions. Our recommendations are based on extensive empirical evaluation that demonstrate the application of a non-binary approach to achieve optimal results. (1) Achieving best performance on each individual dataset requires careful adjustment of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques · Advanced Neural Network Applications
