Network Fission Ensembles for Low-Cost Self-Ensembles

Hojung Lee; Jong-Seok Lee

arXiv:2408.02301·cs.CV·January 30, 2025

Network Fission Ensembles for Low-Cost Self-Ensembles

Hojung Lee, Jong-Seok Lee

PDF

Open Access 1 Repo

TL;DR

Network Fission Ensembles (NFE) transforms a single neural network into a multi-exit structure through pruning and grouping, enabling low-cost ensemble inference without additional models, improving accuracy efficiently.

Contribution

The paper introduces Network Fission Ensembles, a novel method that creates multi-exit networks from a single model for cost-effective ensemble learning.

Findings

01

Achieves significant accuracy improvements over existing ensemble methods.

02

No additional computational burden during inference due to multi-exit structure.

03

Enhances performance through regularization from multiple exit losses.

Abstract

Recent ensemble learning methods for image classification have been shown to improve classification accuracy with low extra cost. However, they still require multiple trained models for ensemble inference, which eventually becomes a significant burden when the model size increases. In this paper, we propose a low-cost ensemble learning and inference, called Network Fission Ensembles (NFE), by converting a conventional network itself into a multi-exit structure. Starting from a given initial network, we first prune some of the weights to reduce the training burden. We then group the remaining weights into several sets and create multiple auxiliary paths using each set to construct multi-exits. We call this process Network Fission. Through this, multiple outputs can be obtained from a single network, which enables ensemble learning. Since this process simply changes the existing network…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hjdw2/nfe
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModular Robots and Swarm Intelligence · Molecular Communication and Nanonetworks · Distributed Control Multi-Agent Systems

MethodsSparse Evolutionary Training