PEA: Improving the Performance of ReLU Networks for Free by Using   Progressive Ensemble Activations

\'Akos Utasi

arXiv:2207.14074·cs.CV·July 29, 2022

PEA: Improving the Performance of ReLU Networks for Free by Using Progressive Ensemble Activations

\'Akos Utasi

PDF

Open Access

TL;DR

This paper introduces a progressive ensemble activation method that combines ReLU with novel activations during training, ultimately simplifying the network to ReLU-only for inference while improving performance.

Contribution

It proposes a novel training approach where ensemble activations are progressively pruned, enhancing ReLU networks without increasing inference complexity.

Findings

01

Achieved 0.2-0.8% top-1 accuracy improvement on ImageNet.

02

Demonstrated 0.34% mIOU boost in semantic segmentation.

03

Validated effectiveness across various architectures and tasks.

Abstract

In recent years novel activation functions have been proposed to improve the performance of neural networks, and they show superior performance compared to the ReLU counterpart. However, there are environments, where the availability of complex activations is limited, and usually only the ReLU is supported. In this paper we propose methods that can be used to improve the performance of ReLU networks by using these efficient novel activations during model training. More specifically, we propose ensemble activations that are composed of the ReLU and one of these novel activations. Furthermore, the coefficients of the ensemble are neither fixed nor learned, but are progressively updated during the training process in a way that by the end of the training only the ReLU activations remain active in the network and the other activations can be removed. This means that in inference time the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · COVID-19 diagnosis using AI · Domain Adaptation and Few-Shot Learning