ARM: Role-Conditioned Neuron Transplantation for Training-Free Generalist LLM Agent Merging

Zhuoka Feng; Kang Chen; Sihan Zhao; Kai Xiong; Yaoning Wang; Minshen Yu; Junjie Nian; Changyi Xiao; Yixin Cao; Yugang Jiang

arXiv:2601.07309·cs.AI·January 13, 2026

ARM: Role-Conditioned Neuron Transplantation for Training-Free Generalist LLM Agent Merging

Zhuoka Feng, Kang Chen, Sihan Zhao, Kai Xiong, Yaoning Wang, Minshen Yu, Junjie Nian, Changyi Xiao, Yixin Cao, Yugang Jiang

PDF

Open Access

TL;DR

ARM introduces a training-free neuron transplantation method for merging multiple large language model experts, enhancing multi-environment adaptability and out-of-domain generalization without gradient updates.

Contribution

It presents a novel role-conditioned neuron transplantation framework that improves model merging for interactive LLM agents across diverse environments.

Findings

01

Outperforms prior merging methods and domain-specific models

02

Enhances cross-benchmark and out-of-domain generalization

03

Operates efficiently without gradient-based optimization

Abstract

Interactive large language model agents have advanced rapidly, but most remain specialized to a single environment and fail to adapt robustly to other environments. Model merging offers a training-free alternative by integrating multiple experts into a single model. In this paper, we propose Agent-Role Merging (ARM), an activation-guided, role-conditioned neuron transplantation method for model merging in LLM agents. ARM improves existing merging methods from static natural language tasks to multi-turn agent scenarios, and over the generalization ability across various interactive environments. This is achieved with a well designed 3-step framework: 1) constructing merged backbones, 2) selection based on its role-conditioned activation analysis, and 3) neuron transplantation for fine-grained refinements. Without gradient-based optimization, ARM improves cross-benchmark generalization…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning