BD-Merging: Bias-Aware Dynamic Model Merging with Evidence-Guided Contrastive Learning

Yuhan Xie; Chen Lyu

arXiv:2603.03920·cs.LG·March 12, 2026

BD-Merging: Bias-Aware Dynamic Model Merging with Evidence-Guided Contrastive Learning

Yuhan Xie, Chen Lyu

PDF

Open Access

TL;DR

BD-Merging introduces a bias-aware, uncertainty modeling framework for model merging in multi-task learning, improving robustness and reliability under distribution shifts through evidence-guided contrastive learning.

Contribution

It proposes a novel bias-aware unsupervised model merging method that explicitly models uncertainty and uses evidence-guided contrastive learning to enhance robustness under distribution shifts.

Findings

01

Outperforms state-of-the-art MM methods in diverse tasks.

02

Effectively mitigates bias and improves generalization under distribution shift.

03

Achieves superior robustness and effectiveness in empirical evaluations.

Abstract

Model Merging (MM) has emerged as a scalable paradigm for multi-task learning (MTL), enabling multiple task-specific models to be integrated without revisiting the original training data. Despite recent progress, the reliability of MM under test-time distribution shift remains insufficiently understood. Most existing MM methods typically assume that test data are clean and distributionally aligned with both the training and auxiliary sources. However, this assumption rarely holds in practice, often resulting in biased predictions with degraded generalization. To address this issue, we present BD-Merging, a bias-aware unsupervised model merging framework that explicitly models uncertainty to achieve adaptive reliability under distribution shift. First, BD-Merging introduces a joint evidential head that learns uncertainty over a unified label space, capturing cross-task semantic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Topic Modeling · Advanced Graph Neural Networks