Model Fusion via Neuron Transplantation

Muhammed \"Oz; Nicholas Kiefer; Charlotte Debus; Jasmin H\"orter,; Achim Streit; Markus G\"otz

arXiv:2502.06849·cs.LG·February 12, 2025

Model Fusion via Neuron Transplantation

Muhammed \"Oz, Nicholas Kiefer, Charlotte Debus, Jasmin H\"orter,, Achim Streit, Markus G\"otz

PDF

1 Repo

TL;DR

This paper introduces Neuron Transplantation, a novel model fusion method that combines ensemble models by transplanting important neurons, reducing memory and inference costs while maintaining or improving performance.

Contribution

The paper presents a new neuron transplantation technique for model fusion that outperforms traditional ensemble methods in efficiency and often in accuracy.

Findings

01

Neuron Transplantation outperforms individual models after fine-tuning.

02

NT requires less fine-tuning and memory than OT-fusion.

03

The fused models achieve comparable or better performance.

Abstract

Ensemble learning is a widespread technique to improve the prediction performance of neural networks. However, it comes at the price of increased memory and inference time. In this work we propose a novel model fusion technique called \emph{Neuron Transplantation (NT)} in which we fuse an ensemble of models by transplanting important neurons from all ensemble members into the vacant space obtained by pruning insignificant neurons. An initial loss in performance post-transplantation can be quickly recovered via fine-tuning, consistently outperforming individual ensemble members of the same model capacity and architecture. Furthermore, NT enables all the ensemble members to be jointly pruned and jointly trained in a combined model. Comparing it to alignment-based averaging (like Optimal-Transport-fusion), it requires less fine-tuning than the corresponding OT-fused model, the fusion…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

masterbaer/neuron-transplantation
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsPruning