ACE-Merging: Data-Free Model Merging with Adaptive Covariance Estimation
Bo Xu, Haotian Wu, Hehai Lin, Weiquan Huang, Beier Zhu, Yao Shu, Chengwei Qin

TL;DR
ACE-Merging introduces a data-free, theoretically grounded method for model merging that estimates input covariance from parameter differences, effectively reducing interference among task-specific models.
Contribution
It presents ACE-Merging, a novel adaptive covariance estimation framework with a closed-form solution that outperforms prior data-free merging methods on vision and language benchmarks.
Findings
ACE-Merging achieves a 4% average improvement over previous methods across seven tasks.
It provides a state-of-the-art data-free model merging technique for vision and language models.
The approach is computationally efficient due to its closed-form solution.
Abstract
Model merging aims to combine multiple task-specific expert models into a single model while preserving generalization across diverse tasks. However, interference among experts, especially when they are trained on different objectives, often leads to significant performance degradation. Despite recent progress, resolving this interference without data access, retraining, or architectural modification remains a fundamental challenge. This paper provides a theoretical analysis demonstrating that the input covariance of each task, which is a key factor for optimal merging, can be implicitly estimated from the parameter differences of its fine-tuned model, even in a fully data-free setting. Building on this insight, we introduce \acem, an Adaptive Covariance Estimation framework that effectively mitigates inter-task interference. Our approach features a principled, closed-form solution that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
