Anticipate, Ensemble and Prune: Improving Convolutional Neural Networks   via Aggregated Early Exits

Simone Sarti; Eugenio Lomurno; Matteo Matteucci

arXiv:2301.12168·cs.LG·January 31, 2023

Anticipate, Ensemble and Prune: Improving Convolutional Neural Networks via Aggregated Early Exits

Simone Sarti, Eugenio Lomurno, Matteo Matteucci

PDF

Open Access

TL;DR

This paper introduces AEP, a training method that leverages weighted ensembles of early exits in neural networks to improve accuracy, reduce parameters, and lower inference latency, especially useful for edge computing.

Contribution

The paper proposes a novel training technique called AEP that exploits early exit structures to enhance neural network performance and efficiency.

Findings

01

Up to 15% accuracy improvement over traditional training.

02

Parameter reduction of up to 41%.

03

Inference latency decreased by 16%.

Abstract

Today, artificial neural networks are the state of the art for solving a variety of complex tasks, especially in image classification. Such architectures consist of a sequence of stacked layers with the aim of extracting useful information and having it processed by a classifier to make accurate predictions. However, intermediate information within such models is often left unused. In other cases, such as in edge computing contexts, these architectures are divided into multiple partitions that are made functional by including early exits, i.e. intermediate classifiers, with the goal of reducing the computational and temporal load without extremely compromising the accuracy of the classifications. In this paper, we present Anticipate, Ensemble and Prune (AEP), a new training technique based on weighted ensembles of early exits, which aims at exploiting the information in the structure of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Advanced Neural Network Applications · Machine Learning and ELM

MethodsPruning