Activation-Guided Consensus Merging for Large Language Models

Yuxuan Yao; Shuqi Liu; Zehua Liu; Qintong Li; Mingyang Liu; Xiongwei Han; Zhijiang Guo; Han Wu; Linqi Song

arXiv:2505.14009·cs.CL·November 17, 2025

Activation-Guided Consensus Merging for Large Language Models

Yuxuan Yao, Shuqi Liu, Zehua Liu, Qintong Li, Mingyang Liu, Xiongwei Han, Zhijiang Guo, Han Wu, Linqi Song

PDF

Open Access

TL;DR

This paper introduces Activation-Guided Consensus Merging (ACM), a novel, efficient layer-specific model merging method for large language models that preserves capabilities and improves performance without additional training.

Contribution

ACM is a new plug-and-play framework that determines layer importance using mutual information, addressing heterogeneity in neural components during model merging.

Findings

01

ACM outperforms baseline merging methods in various tasks.

02

In Qwen-7B models, ACM reduces response length by 55.3%.

03

ACM improves reasoning accuracy by 1.3 points.

Abstract

Recent research has increasingly focused on reconciling the reasoning capabilities of System 2 with the efficiency of System 1. While existing training-based and prompt-based approaches face significant challenges in terms of efficiency and stability, model merging emerges as a promising strategy to integrate the diverse capabilities of different Large Language Models (LLMs) into a unified model. However, conventional model merging methods often assume uniform importance across layers, overlooking the functional heterogeneity inherent in neural components. To address this limitation, we propose \textbf{A}ctivation-Guided \textbf{C}onsensus \textbf{M}erging (\textbf{ACM}), a plug-and-play merging framework that determines layer-specific merging coefficients based on mutual information between activations of pre-trained and fine-tuned models. ACM effectively preserves task-specific…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI)