TL;DR
This paper introduces partial fusion of neural networks, a method that interpolates between ensembles and weight aggregation to balance computational cost and accuracy, using neuron similarity and optimal transport.
Contribution
It proposes a novel partial fusion technique that combines neurons based on similarity, enabling flexible tradeoffs between ensemble accuracy and weight aggregation efficiency.
Findings
Partial fusion achieves a better tradeoff between performance and computational cost.
Neuron similarity-based aggregation improves model efficiency.
Generalized pruning with partial fusion yields similar benefits to ensemble methods.
Abstract
Ensembles of neural networks typically outperform individual networks but incur large computational costs, whereas weight aggregation produces less costly, yet also less accurate, aggregate models. We introduce partial fusion of networks, which interpolates between ensembles and weight aggregation and thus allows for a flexible tradeoff between computational cost and performance. A direct way to achieve this is to extend existing weight aggregation methods based on neuron-level similarity between different networks, where partial fusion then only aggregates weights of neurons which are most similar. We showcase one particular method to jointly identify which neurons are most similar and match them via partial optimal transport. Further, we consider the more general perspective of weight aggregation and partial fusion as generalized pruning of ensemble models, where neurons cannot just…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
