Deep Model Merging: The Sister of Neural Network Interpretability -- A   Survey

Arham Khan; Todd Nief; Nathaniel Hudson; Mansi Sakarvadia; Daniel; Grzenda; Aswathy Ajith; Jordan Pettyjohn; Kyle Chard; Ian Foster

arXiv:2410.12927·cs.LG·March 25, 2025

Deep Model Merging: The Sister of Neural Network Interpretability -- A Survey

Arham Khan, Todd Nief, Nathaniel Hudson, Mansi Sakarvadia, Daniel, Grzenda, Aswathy Ajith, Jordan Pettyjohn, Kyle Chard, Ian Foster

PDF

Open Access

TL;DR

This survey explores how loss landscape geometry influences neural network training, interpretability, and robustness, highlighting key characteristics like mode convexity and connectivity, and proposing future research directions.

Contribution

It synthesizes empirical findings on model merging and loss landscapes, connecting them to interpretability and robustness, and suggests new research avenues.

Findings

01

Identifies four key loss landscape characteristics: mode convexity, determinism, directedness, connectivity.

02

Links model merging insights to interpretability and robustness.

03

Proposes new research directions at the intersection of these fields.

Abstract

We survey the model merging literature through the lens of loss landscape geometry to connect observations from empirical studies on model merging and loss landscape analysis to phenomena that govern neural network training and the emergence of their inner representations. We distill repeated empirical observations from the literature in these fields into descriptions of four major characteristics of loss landscape geometry: mode convexity, determinism, directedness, and connectivity. We argue that insights into the structure of learned representations from model merging have applications to model interpretability and robustness, subsequently we propose promising new research directions at the intersection of these fields.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Remote Sensing and LiDAR Applications