Competition-based Adaptive ReLU for Deep Neural Networks

Junjia Chen; Zhibin Pan

arXiv:2407.19441·cs.NE·July 30, 2024

Competition-based Adaptive ReLU for Deep Neural Networks

Junjia Chen, Zhibin Pan

PDF

Open Access

TL;DR

This paper introduces CAReLU, a novel activation function that models competition between positive and negative inputs, improving performance across various deep learning tasks by adaptively scaling inputs.

Contribution

The paper proposes CAReLU, a new adaptive activation function that incorporates competition between positive and negative values, trained jointly with network parameters.

Findings

01

CAReLU outperforms traditional activation functions in multiple tasks.

02

Replacing ReLU with CAReLU in ResNet-18 improves CIFAR-100 accuracy.

03

CAReLU offers a new perspective on activation function design through competition modeling.

Abstract

Activation functions introduce nonlinearity into deep neural networks. Most popular activation functions allow positive values to pass through while blocking or suppressing negative values. From the idea that positive values and negative values are equally important, and they must compete for activation, we proposed a new Competition-based Adaptive ReLU (CAReLU). CAReLU scales the input values based on the competition results between positive values and negative values. It defines two parameters to adjust the scaling strategy and can be trained uniformly with other network parameters. We verify the effectiveness of CAReLU on image classification, super-resolution, and natural language processing tasks. In the experiment, our method performs better than other widely used activation functions. In the case of replacing ReLU in ResNet-18 with our proposed activation function, it improves…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

Methods*Communicated@Fast*How Do I Communicate to Expedia?