DR-Tune: Improving Fine-tuning of Pretrained Visual Models by Distribution Regularization with Semantic Calibration
Nan Zhou, Jiaxin Chen, Di Huang

TL;DR
DR-Tune is a novel fine-tuning framework for pretrained visual models that employs distribution regularization and semantic calibration to prevent overfitting and address semantic drift, leading to improved downstream task performance.
Contribution
It introduces distribution regularization combined with semantic calibration to enhance fine-tuning of pretrained visual models, a novel approach in this domain.
Findings
Consistently improves image classification accuracy across datasets.
Effectively prevents overfitting during fine-tuning.
Addresses semantic drift to maintain feature alignment.
Abstract
The visual models pretrained on large-scale benchmarks encode general knowledge and prove effective in building more powerful representations for downstream tasks. Most existing approaches follow the fine-tuning paradigm, either by initializing or regularizing the downstream model based on the pretrained one. The former fails to retain the knowledge in the successive fine-tuning phase, thereby prone to be over-fitting, and the latter imposes strong constraints to the weights or feature maps of the downstream model without considering semantic drift, often incurring insufficient optimization. To deal with these issues, we propose a novel fine-tuning framework, namely distribution regularization with semantic calibration (DR-Tune). It employs distribution regularization by enforcing the downstream task head to decrease its classification error on the pretrained feature distribution, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Enhancement Techniques · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications
MethodsALIGN
