From Parameter to Representation: A Closed-Form Approach for Controllable Model Merging
Jialin Wu, Jian Yang, Handing Wang, Jiajun Wen, Zhiyong Yu

TL;DR
This paper introduces a novel closed-form method for controllable model merging that efficiently generates Pareto-optimal models by directly correcting representations, significantly reducing computational costs compared to previous iterative approaches.
Contribution
The authors propose a representation correction approach using a closed-form solution, replacing costly offline optimization with a single, scalable computation for multi-task model merging.
Findings
Produces a superior Pareto front with better preference alignment
Reduces computational complexity from exponential to linear in the number of tasks
Enables on-the-fly, preference-aware model generation
Abstract
Model merging combines expert models for multitask performance but faces challenges from parameter interference. This has sparked recent interest in controllable model merging, giving users the ability to explicitly balance performance trade-offs. Existing approaches employ a compile-then-query paradigm, performing a costly offline multi-objective optimization to enable fast, preference-aware model generation. This offline stage typically involves iterative search or dedicated training, with complexity that grows exponentially with the number of tasks. To overcome these limitations, we shift the perspective from parameter-space optimization to a direct correction of the model's final representation. Our approach models this correction as an optimal linear transformation, yielding a closed-form solution that replaces the entire offline optimization process with a single-step,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Multi-Objective Optimization Algorithms · Constraint Satisfaction and Optimization · Machine Learning and Data Classification
