Rethinking the Bias of Foundation Model under Long-tailed Distribution

Jiahao Chen; Bin Qin; Jiangmeng Li; Hao Chen; Bing Su

arXiv:2501.15955·cs.LG·August 11, 2025

Rethinking the Bias of Foundation Model under Long-tailed Distribution

Jiahao Chen, Bin Qin, Jiangmeng Li, Hao Chen, Bing Su

PDF

Open Access

TL;DR

This paper investigates how inherent biases from pre-training data affect foundation models in long-tailed tasks, revealing parameter imbalance as a key issue and proposing a causal learning-based backdoor adjustment method to improve performance.

Contribution

It identifies the dominant role of parameter imbalance inherited from pre-training and introduces a novel causal learning approach with backdoor adjustment to mitigate these biases.

Findings

01

Parameter imbalance is more critical than data imbalance during fine-tuning.

02

Current re-balancing methods cannot effectively address parameter imbalance.

03

Proposed causal learning method improves performance by approximately 1.67% across datasets.

Abstract

Long-tailed learning has garnered increasing attention due to its practical significance. Among the various approaches, the fine-tuning paradigm has gained considerable interest with the advent of foundation models. However, most existing methods primarily focus on leveraging knowledge from these models, overlooking the inherent biases introduced by the imbalanced training data they rely on. In this paper, we examine how such imbalances from pre-training affect long-tailed downstream tasks. Specifically, we find the imbalance biases inherited in foundation models on downstream task as parameter imbalance and data imbalance. During fine-tuning, we observe that parameter imbalance plays a more critical role, while data imbalance can be mitigated using existing re-balancing strategies. Moreover, we find that parameter imbalance cannot be effectively addressed by current re-balancing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDam Engineering and Safety · Rough Sets and Fuzzy Logic · Reservoir Engineering and Simulation Methods

MethodsSoftmax · Attention Is All You Need · Focus