Domain-Aware Fine-Tuning: Enhancing Neural Network Adaptability
Seokhyeon Ha, Sunbeom Jung, Jungwoo Lee

TL;DR
This paper introduces Domain-Aware Fine-Tuning (DAFT), a novel method that improves neural network adaptation by converting batch normalization layers and combining linear probing with fine-tuning, reducing feature distortion and enhancing performance.
Contribution
The paper proposes DAFT, a new fine-tuning approach that effectively mitigates feature distortion through batch normalization conversion and integrated linear probing, outperforming existing methods.
Findings
DAFT reduces feature distortion during fine-tuning.
DAFT improves performance on in-distribution and out-of-distribution datasets.
Extensive experiments show DAFT outperforms baseline methods.
Abstract
Fine-tuning pre-trained neural network models has become a widely adopted approach across various domains. However, it can lead to the distortion of pre-trained feature extractors that already possess strong generalization capabilities. Mitigating feature distortion during adaptation to new target domains is crucial. Recent studies have shown promising results in handling feature distortion by aligning the head layer on in-distribution datasets before performing fine-tuning. Nonetheless, a significant limitation arises from the treatment of batch normalization layers during fine-tuning, leading to suboptimal performance. In this paper, we propose Domain-Aware Fine-Tuning (DAFT), a novel approach that incorporates batch normalization conversion and the integration of linear probing and fine-tuning. Our batch normalization conversion method effectively mitigates feature distortion by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Machine Learning and Data Classification
MethodsBatch Normalization
