Large Pre-Training Datasets Don't Always Guarantee Robustness after Fine-Tuning

Jaedong Hwang; Brian Cheung; Zhang-Wei Hong; Akhilan Boopathy; Pulkit Agrawal; Ila Fiete

arXiv:2410.21582·cs.CV·September 29, 2025

Large Pre-Training Datasets Don't Always Guarantee Robustness after Fine-Tuning

Jaedong Hwang, Brian Cheung, Zhang-Wei Hong, Akhilan Boopathy, Pulkit Agrawal, Ila Fiete

PDF

Open Access

TL;DR

This paper investigates how fine-tuning large pretrained models affects their robustness, revealing that larger datasets do not always lead to better robustness preservation, and introduces a benchmark for systematic assessment.

Contribution

The paper introduces the Robustness Inheritance Benchmark (ImageNet-RIB) to evaluate robustness transfer during fine-tuning and demonstrates that larger pretraining datasets can lead to greater robustness loss.

Findings

01

Fine-tuning often reduces model robustness, even with continual learning methods.

02

Models pretrained on larger datasets like LAION-2B suffer more robustness loss.

03

Starting with the largest pretrained models may not be optimal for specialized tasks.

Abstract

Large-scale pretrained models are widely leveraged as foundations for learning new specialized tasks via fine-tuning, with the goal of maintaining the general performance of the model while allowing it to gain new skills. A valuable goal for all such models is robustness: the ability to perform well on out-of-distribution (OOD) tasks. We assess whether fine-tuning preserves the overall robustness of the pretrained model, and observed that models pretrained on large datasets exhibited strong catastrophic forgetting and loss of OOD generalization. To systematically assess robustness preservation in fine-tuned models, we propose the Robustness Inheritance Benchmark (ImageNet-RIB). The benchmark, which can be applied to any pretrained model, consists of a set of related but distinct OOD (downstream) tasks and involves fine-tuning on one of the OOD tasks in the set then testing on the rest.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced X-ray and CT Imaging · Radiomics and Machine Learning in Medical Imaging · Medical Imaging Techniques and Applications

MethodsElastic Weight Consolidation · Sparse Evolutionary Training