BD-Merging: Bias-Aware Dynamic Model Merging with Evidence-Guided Contrastive Learning
Yuhan Xie, Chen Lyu

TL;DR
BD-Merging introduces a bias-aware, uncertainty modeling framework for model merging in multi-task learning, improving robustness and reliability under distribution shifts through evidence-guided contrastive learning.
Contribution
It proposes a novel bias-aware unsupervised model merging method that explicitly models uncertainty and uses evidence-guided contrastive learning to enhance robustness under distribution shifts.
Findings
Outperforms state-of-the-art MM methods in diverse tasks.
Effectively mitigates bias and improves generalization under distribution shift.
Achieves superior robustness and effectiveness in empirical evaluations.
Abstract
Model Merging (MM) has emerged as a scalable paradigm for multi-task learning (MTL), enabling multiple task-specific models to be integrated without revisiting the original training data. Despite recent progress, the reliability of MM under test-time distribution shift remains insufficiently understood. Most existing MM methods typically assume that test data are clean and distributionally aligned with both the training and auxiliary sources. However, this assumption rarely holds in practice, often resulting in biased predictions with degraded generalization. To address this issue, we present BD-Merging, a bias-aware unsupervised model merging framework that explicitly models uncertainty to achieve adaptive reliability under distribution shift. First, BD-Merging introduces a joint evidential head that learns uncertainty over a unified label space, capturing cross-task semantic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Topic Modeling · Advanced Graph Neural Networks
