PEA: Improving the Performance of ReLU Networks for Free by Using Progressive Ensemble Activations
\'Akos Utasi

TL;DR
This paper introduces a progressive ensemble activation method that combines ReLU with novel activations during training, ultimately simplifying the network to ReLU-only for inference while improving performance.
Contribution
It proposes a novel training approach where ensemble activations are progressively pruned, enhancing ReLU networks without increasing inference complexity.
Findings
Achieved 0.2-0.8% top-1 accuracy improvement on ImageNet.
Demonstrated 0.34% mIOU boost in semantic segmentation.
Validated effectiveness across various architectures and tasks.
Abstract
In recent years novel activation functions have been proposed to improve the performance of neural networks, and they show superior performance compared to the ReLU counterpart. However, there are environments, where the availability of complex activations is limited, and usually only the ReLU is supported. In this paper we propose methods that can be used to improve the performance of ReLU networks by using these efficient novel activations during model training. More specifically, we propose ensemble activations that are composed of the ReLU and one of these novel activations. Furthermore, the coefficients of the ensemble are neither fixed nor learned, but are progressively updated during the training process in a way that by the end of the training only the ReLU activations remain active in the network and the other activations can be removed. This means that in inference time the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · COVID-19 diagnosis using AI · Domain Adaptation and Few-Shot Learning
