Structured Ensembles: an Approach to Reduce the Memory Footprint of   Ensemble Methods

Jary Pomponi; Simone Scardapane; and Aurelio Uncini

arXiv:2105.02551·cs.LG·October 7, 2022

Structured Ensembles: an Approach to Reduce the Memory Footprint of Ensemble Methods

Jary Pomponi, Simone Scardapane, and Aurelio Uncini

PDF

2 Repos

TL;DR

This paper introduces Structured Ensembles, a novel method to extract diverse sub-networks from a single untrained neural network, significantly reducing memory requirements while maintaining or improving accuracy and uncertainty calibration.

Contribution

The paper presents a new end-to-end optimization approach for creating memory-efficient neural network ensembles by extracting sub-structures, with applications to continual learning.

Findings

01

Achieves comparable or higher accuracy than existing methods.

02

Requires significantly less memory for ensemble storage.

03

Performs well in uncertainty estimation and continual learning scenarios.

Abstract

In this paper, we propose a novel ensembling technique for deep neural networks, which is able to drastically reduce the required memory compared to alternative approaches. In particular, we propose to extract multiple sub-networks from a single, untrained neural network by solving an end-to-end optimization task combining differentiable scaling over the original architecture, with multiple regularization terms favouring the diversity of the ensemble. Since our proposal aims to detect and extract sub-structures, we call it Structured Ensemble. On a large experimental evaluation, we show that our method can achieve higher or comparable accuracy to competing methods while requiring significantly less storage. In addition, we evaluate our ensembles in terms of predictive calibration and uncertainty, showing they compare favourably with the state-of-the-art. Finally, we draw a link with the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.