AutoMerge: Search-Based Model Merging Framework for Effective Model Reuse
You Lu, Jiyang Zhang, Bihuan Chen, Chaofeng Sha, Dingji Wang, Xin Peng

TL;DR
AutoMerge introduces a systematic, search-based framework for model merging that effectively handles diverse architectures and domains, overcoming limitations of existing techniques and enhancing model reuse.
Contribution
This paper presents AutoMerge, the first search-based model merging framework that adapts to heterogeneous models across domains, improving upon existing methods' limitations.
Findings
Existing merging techniques are inconsistent across models and domains.
A single merging method cannot effectively handle heterogeneous model structures.
Hyperparameter sensitivity limits the broader applicability of current merging techniques.
Abstract
Software reuse has long been recognized as a critical and widely studied topic in software engineering, offering substantial benefits in reducing development costs, improving software quality, and enhancing operational efficiency. This paradigm extends into deep learning through model reuse. Recently, model merging has emerged in the domain of large language models (LLMs) as a training-free approach that takes multiple task-specific models with the same architecture as source models and merges them without retraining, enhancing model reuse within LLMs. However, no prior work has systematically investigated whether such an approach can be effectively applied to other deep learning models with different architectures across domains. To bridge this gap, we present the first systematic study that evaluates five model merging techniques on three distinct model architectures across three…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Domain Adaptation and Few-Shot Learning · Software Engineering Research
