Upweighting Easy Samples in Fine-Tuning Mitigates Forgetting

Sunny Sanyal; Hayden Prairie; Rudrajit Das; Ali Kavis; Sujay Sanghavi

arXiv:2502.02797·cs.LG·June 13, 2025

Upweighting Easy Samples in Fine-Tuning Mitigates Forgetting

Sunny Sanyal, Hayden Prairie, Rudrajit Das, Ali Kavis, Sujay Sanghavi

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a sample weighting scheme based on pre-trained model losses to mitigate catastrophic forgetting during fine-tuning, effectively preserving original capabilities while adapting to new tasks.

Contribution

It proposes a novel sample weighting method that upweights easy samples based on pre-trained model losses, addressing forgetting without access to original training data.

Findings

01

Reduces accuracy drop on target task during fine-tuning.

02

Preserves more pre-training knowledge compared to standard fine-tuning.

03

Effective on both language and vision tasks.

Abstract

Fine-tuning a pre-trained model on a downstream task often degrades its original capabilities, a phenomenon known as "catastrophic forgetting". This is especially an issue when one does not have access to the data and recipe used to develop the pre-trained model. Under this constraint, most existing methods for mitigating forgetting are inapplicable. To address this challenge, we propose a sample weighting scheme for the fine-tuning data solely based on the pre-trained model's losses. Specifically, we upweight the easy samples on which the pre-trained model's loss is low and vice versa to limit the drift from the pre-trained model. Our approach is orthogonal and yet complementary to existing methods; while such methods mostly operate on parameter or gradient space, we concentrate on the sample space. We theoretically analyze the impact of fine-tuning with our method in a linear setting,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sanyalsunny111/flow_finetuning
pytorchOfficial

Videos

Upweighting Easy Samples in Fine-Tuning Mitigates Forgetting· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning