Network Fission Ensembles for Low-Cost Self-Ensembles
Hojung Lee, Jong-Seok Lee

TL;DR
Network Fission Ensembles (NFE) transforms a single neural network into a multi-exit structure through pruning and grouping, enabling low-cost ensemble inference without additional models, improving accuracy efficiently.
Contribution
The paper introduces Network Fission Ensembles, a novel method that creates multi-exit networks from a single model for cost-effective ensemble learning.
Findings
Achieves significant accuracy improvements over existing ensemble methods.
No additional computational burden during inference due to multi-exit structure.
Enhances performance through regularization from multiple exit losses.
Abstract
Recent ensemble learning methods for image classification have been shown to improve classification accuracy with low extra cost. However, they still require multiple trained models for ensemble inference, which eventually becomes a significant burden when the model size increases. In this paper, we propose a low-cost ensemble learning and inference, called Network Fission Ensembles (NFE), by converting a conventional network itself into a multi-exit structure. Starting from a given initial network, we first prune some of the weights to reduce the training burden. We then group the remaining weights into several sets and create multiple auxiliary paths using each set to construct multi-exits. We call this process Network Fission. Through this, multiple outputs can be obtained from a single network, which enables ensemble learning. Since this process simply changes the existing network…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModular Robots and Swarm Intelligence · Molecular Communication and Nanonetworks · Distributed Control Multi-Agent Systems
MethodsSparse Evolutionary Training
