SurgeryV2: Bridging the Gap Between Model Merging and Multi-Task Learning with Deep Representation Surgery
Enneng Yang, Li Shen, Zhenyi Wang, Guibing Guo, Xingwei Wang, Xiaocun, Cao, Jie Zhang, Dacheng Tao

TL;DR
SurgeryV2 introduces a deep representation surgery method that aligns representations across all layers of merged models, significantly reducing bias and closing the performance gap with traditional multi-task learning.
Contribution
This paper proposes a novel deep representation surgery approach that mitigates layer-wise representation bias in model merging-based multi-task learning, achieving performance comparable to traditional methods.
Findings
SurgeryV2 nearly matches the performance of individual expert models.
Representation bias exists at all layers and impacts model merging effectiveness.
Aligning representations across all layers effectively reduces systemic bias.
Abstract
Model merging-based multitask learning (MTL) offers a promising approach for performing MTL by merging multiple expert models without requiring access to raw training data. However, in this paper, we examine the merged model's representation distribution and uncover a critical issue of "representation bias". This bias arises from a significant distribution gap between the representations of the merged and expert models, leading to the suboptimal performance of the merged MTL model. To address this challenge, we first propose a representation surgery solution called Surgery. Surgery is a lightweight, task-specific module that aligns the final layer representations of the merged model with those of the expert models, effectively alleviating bias and improving the merged model's performance. Despite these improvements, a performance gap remains compared to the traditional MTL method.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSurgical Simulation and Training · Colorectal Cancer Surgical Treatments · Anatomy and Medical Technology
