AutoMerge: Search-Based Model Merging Framework for Effective Model Reuse

You Lu; Jiyang Zhang; Bihuan Chen; Chaofeng Sha; Dingji Wang; Xin Peng

arXiv:2601.22748·cs.SE·February 2, 2026

AutoMerge: Search-Based Model Merging Framework for Effective Model Reuse

You Lu, Jiyang Zhang, Bihuan Chen, Chaofeng Sha, Dingji Wang, Xin Peng

PDF

Open Access

TL;DR

AutoMerge introduces a systematic, search-based framework for model merging that effectively handles diverse architectures and domains, overcoming limitations of existing techniques and enhancing model reuse.

Contribution

This paper presents AutoMerge, the first search-based model merging framework that adapts to heterogeneous models across domains, improving upon existing methods' limitations.

Findings

01

Existing merging techniques are inconsistent across models and domains.

02

A single merging method cannot effectively handle heterogeneous model structures.

03

Hyperparameter sensitivity limits the broader applicability of current merging techniques.

Abstract

Software reuse has long been recognized as a critical and widely studied topic in software engineering, offering substantial benefits in reducing development costs, improving software quality, and enhancing operational efficiency. This paradigm extends into deep learning through model reuse. Recently, model merging has emerged in the domain of large language models (LLMs) as a training-free approach that takes multiple task-specific models with the same architecture as source models and merges them without retraining, enhancing model reuse within LLMs. However, no prior work has systematically investigated whether such an approach can be effectively applied to other deep learning models with different architectures across domains. To bridge this gap, we present the first systematic study that evaluates five model merging techniques on three distinct model architectures across three…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Domain Adaptation and Few-Shot Learning · Software Engineering Research