MeGA: Merging Multiple Independently Trained Neural Networks Based on   Genetic Algorithm

Daniel Yun

arXiv:2406.04607·cs.NE·July 1, 2024

MeGA: Merging Multiple Independently Trained Neural Networks Based on Genetic Algorithm

Daniel Yun

PDF

Open Access 1 Repo

TL;DR

This paper presents MeGA, a genetic algorithm-based method for merging multiple pre-trained neural networks, which enhances model accuracy and robustness by optimally combining weights.

Contribution

It introduces a novel genetic algorithm approach for effectively merging independently trained neural networks, outperforming traditional averaging and ensemble techniques.

Findings

01

Improved test accuracy on CIFAR-10 compared to individual models.

02

Enhanced robustness of merged models.

03

Scalable method for integrating multiple pre-trained networks.

Abstract

In this paper, we introduce a novel method for merging the weights of multiple pre-trained neural networks using a genetic algorithm called MeGA. Traditional techniques, such as weight averaging and ensemble methods, often fail to fully harness the capabilities of pre-trained networks. Our approach leverages a genetic algorithm with tournament selection, crossover, and mutation to optimize weight combinations, creating a more effective fusion. This technique allows the merged model to inherit advantageous features from both parent models, resulting in enhanced accuracy and robustness. Through experiments on the CIFAR-10 dataset, we demonstrate that our genetic algorithm-based weight merging method improves test accuracy compared to individual models and conventional methods. This approach provides a scalable solution for integrating multiple pre-trained networks across various deep…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yunblak/mega-merging-multiple-independently-trained-neural-networks-based-on-genetic-algorithm
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications