ACE-Merging: Data-Free Model Merging with Adaptive Covariance Estimation

Bo Xu; Haotian Wu; Hehai Lin; Weiquan Huang; Beier Zhu; Yao Shu; Chengwei Qin

arXiv:2603.02945·cs.CL·April 9, 2026

ACE-Merging: Data-Free Model Merging with Adaptive Covariance Estimation

Bo Xu, Haotian Wu, Hehai Lin, Weiquan Huang, Beier Zhu, Yao Shu, Chengwei Qin

PDF

TL;DR

ACE-Merging introduces a data-free, theoretically grounded method for model merging that estimates input covariance from parameter differences, effectively reducing interference among task-specific models.

Contribution

It presents ACE-Merging, a novel adaptive covariance estimation framework with a closed-form solution that outperforms prior data-free merging methods on vision and language benchmarks.

Findings

01

ACE-Merging achieves a 4% average improvement over previous methods across seven tasks.

02

It provides a state-of-the-art data-free model merging technique for vision and language models.

03

The approach is computationally efficient due to its closed-form solution.

Abstract

Model merging aims to combine multiple task-specific expert models into a single model while preserving generalization across diverse tasks. However, interference among experts, especially when they are trained on different objectives, often leads to significant performance degradation. Despite recent progress, resolving this interference without data access, retraining, or architectural modification remains a fundamental challenge. This paper provides a theoretical analysis demonstrating that the input covariance of each task, which is a key factor for optimal merging, can be implicitly estimated from the parameter differences of its fine-tuned model, even in a fully data-free setting. Building on this insight, we introduce \acem, an Adaptive Covariance Estimation framework that effectively mitigates inter-task interference. Our approach features a principled, closed-form solution that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.