An Empirical Study of Pre-trained Model Selection for   Out-of-Distribution Generalization and Calibration

Hiroki Naganuma; Ryuichiro Hataya; Kotaro Yoshida; Ioannis Mitliagkas

arXiv:2307.08187·cs.LG·April 29, 2025·2 cites

An Empirical Study of Pre-trained Model Selection for Out-of-Distribution Generalization and Calibration

Hiroki Naganuma, Ryuichiro Hataya, Kotaro Yoshida, Ioannis Mitliagkas

PDF

Open Access 1 Repo

TL;DR

This study empirically evaluates how pre-trained model size, dataset, and training strategies influence out-of-distribution generalization and calibration, revealing the critical importance of model selection for robust performance.

Contribution

It systematically analyzes the impact of pre-trained model characteristics on OOD performance and calibration, highlighting the significance of model selection over algorithm improvements.

Findings

01

Larger models and datasets improve OOD accuracy and calibration.

02

Optimal pre-trained model choices significantly outperform algorithm-focused methods.

03

Modern deep networks can have better calibration than shallow models, contrary to prior beliefs.

Abstract

In the field of computer vision, fine-tuning pre-trained models has become a prevalent strategy for out-of-distribution (OOD) generalization tasks. Different from most prior work that has focused on advancing learning algorithms, we systematically examined how pre-trained model size, pre-training dataset size, and training strategies impact generalization and confidence calibration on downstream tasks. We evaluated 100 models across diverse pre-trained model sizes, five pre-training datasets, and five data augmentations through extensive experiments on four distribution shift datasets totaling over 120,000 GPU hours. Our results demonstrate the significant impact of pre-trained model selection, with optimal choices substantially improving OOD accuracy over algorithm improvement alone. Additionally, we find that larger models and bigger pre-training datasets not only enhance OOD…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hiroki11x/timm_ood_calibration
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Anomaly Detection Techniques and Applications

MethodsFocus