Superpose Task-specific Features for Model Merging

Haiquan Qiu; You Wu; Dong Li; Jianmin Guo; Quanming Yao

arXiv:2502.10698·cs.LG·September 19, 2025

Superpose Task-specific Features for Model Merging

Haiquan Qiu, You Wu, Dong Li, Jianmin Guo, Quanming Yao

PDF

Open Access 1 Video

TL;DR

This paper introduces a novel model merging method that superposes task-specific features based on linear representations, effectively preserving multi-task capabilities without additional training.

Contribution

It proposes a linear system-based approach to merge task-specific features, improving multi-task performance over existing methods.

Findings

01

Outperforms existing model merging techniques on diverse benchmarks

02

Effectively preserves multi-task capabilities in merged models

03

Demonstrates robustness across different neural network architectures

Abstract

Model merging enables powerful capabilities in neural networks without requiring additional training. In this paper, we introduce a novel perspective on model merging by leveraging the fundamental mechanisms of neural network representation. Our approach is motivated by the linear representation hypothesis, which states that neural networks encode information through linear combinations of feature vectors. We propose a method that superposes task-specific features from individual models into a merged model. Our approach specifically targets linear transformation matrices, which are crucial for feature activation and extraction in deep networks. By formulating the merging process as a linear system, we can preserve task-specific features from individual models and create merged models that effectively maintain multi-task capabilities compared to existing methods. Extensive experiments…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Superpose Task-specific Features for Model Merging· underline

Taxonomy

TopicsModel-Driven Software Engineering Techniques · Semantic Web and Ontologies · 3D Modeling in Geospatial Applications