RegMean++: Enhancing Effectiveness and Generalization of Regression Mean for Model Merging

The-Hai Nguyen; Dang Huu-Tien; Takeshi Suzuki; and Le-Minh Nguyen

arXiv:2508.03121·cs.LG·April 28, 2026

RegMean++: Enhancing Effectiveness and Generalization of Regression Mean for Model Merging

The-Hai Nguyen, Dang Huu-Tien, Takeshi Suzuki, and Le-Minh Nguyen

PDF

1 Repo

TL;DR

RegMean++ improves model merging by explicitly modeling layer dependencies, leading to better performance and generalization across various tasks and distribution shifts.

Contribution

It introduces a new method, RegMean++, that incorporates intra-layer and cross-layer dependencies into model merging, surpassing the original RegMean.

Findings

01

RegMean++ outperforms RegMean in diverse settings.

02

It achieves better generalization in ID and OOD scenarios.

03

RegMean++ is robust under distribution shifts.

Abstract

Regression Mean (RegMean), an approach that formulates model merging as a linear regression problem, aims to find the optimal weights for each linear layer in the merged model by minimizing the discrepancy in predictions between the merged and candidate models. RegMean provides a precise closed-form solution for the merging problem; therefore, it offers explainability and computational efficiency. However, RegMean merges each linear layer independently, overlooking how the features and information in earlier layers propagate through deeper layers and influence the final predictions of the merged model. Here, we introduce RegMean++, a simple yet effective alternative to RegMean, that explicitly incorporates both intra-layer and cross-layer dependencies between merged models' layers into RegMean's objective. By accounting for these dependencies, RegMean++ better captures the behaviors of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nthehai01/RegMean-plusplus
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.