FeatCal: Feature Calibration for Post-Merging Models

Yanggan Gu; Shuo Cai; Zihao Wang; Wenjun Wang; Yuanyi Wang; Pengkai Wang; Sirui Huang; Su Lu; Jianmin Wu; Hongxia Yang

arXiv:2605.13030·cs.LG·May 14, 2026

FeatCal: Feature Calibration for Post-Merging Models

Yanggan Gu, Shuo Cai, Zihao Wang, Wenjun Wang, Yuanyi Wang, Pengkai Wang, Sirui Huang, Su Lu, Jianmin Wu, Hongxia Yang

PDF

1 Repo

TL;DR

FeatCal is a novel feature calibration method for post-merging models that reduces feature drift and improves performance efficiently without iterative training, demonstrated on CLIP and GLUE benchmarks.

Contribution

It introduces a layer-by-layer weight calibration approach using a small set, with a closed-form solution, outperforming existing baselines in efficiency and accuracy.

Findings

01

FeatCal achieves higher accuracy than baselines on CLIP and GLUE benchmarks.

02

It requires fewer examples and less time for calibration, demonstrating better sample efficiency.

03

The method effectively reduces feature drift and output drift in merged models.

Abstract

Model merging combines task experts into one model and avoids joint training, retraining, or deploying many expert models, but the merged model often still underperforms task experts. We study this performance gap through feature drift, the difference between features produced by the merged model and by the expert on the same input. Our theory decomposes this drift into upstream propagation and local mismatch, tracks how it propagates and combines through later layers in forward order, and links final feature drift to output drift. This view motivates FeatCal, which uses a small calibration set to calibrate the merged model weights layer by layer in forward order, reducing feature drift while staying close to merged weights and preserving the benefits of model merging. FeatCal uses an efficient closed-form solution to update model weights, with no gradient descent, iterative…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

egangu/featcal
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.