Representation Surgery for Multi-Task Model Merging
Enneng Yang, Li Shen, Zhenyi Wang, Guibing Guo, Xiaojun, Chen, Xingwei Wang, Dacheng Tao

TL;DR
This paper introduces a lightweight 'Surgery' module that reduces representation bias in merged multi-task models, significantly improving their performance by aligning representations of individual and merged models.
Contribution
The paper proposes a novel representation surgery method to address representation bias in model merging for multi-task learning, enhancing performance without retraining from raw data.
Findings
Significant performance improvements on state-of-the-art model merging schemes.
Effective reduction of representation bias through the Surgery module.
Unsupervised optimization successfully aligns representations of merged and individual models.
Abstract
Multi-task learning (MTL) compresses the information from multiple tasks into a unified backbone to improve computational efficiency and generalization. Recent work directly merges multiple independently trained models to perform MTL instead of collecting their raw data for joint training, greatly expanding the application scenarios of MTL. However, by visualizing the representation distribution of existing model merging schemes, we find that the merged model often suffers from the dilemma of representation bias. That is, there is a significant discrepancy in the representation distribution between the merged and individual models, resulting in poor performance of merged MTL. In this paper, we propose a representation surgery solution called "Surgery" to reduce representation bias in the merged model. Specifically, Surgery is a lightweight task-specific module that takes the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnatomy and Medical Technology
