Empirical study of the modulus as activation function in computer vision   applications

Iv\'an Vall\'es-P\'erez; Emilio Soria-Olivas; Marcelino; Mart\'inez-Sober; Antonio J. Serrano-L\'opez; Joan Vila-Franc\'es; Juan; G\'omez-Sanch\'is

arXiv:2301.05993·cs.CV·January 18, 2023

Empirical study of the modulus as activation function in computer vision applications

Iv\'an Vall\'es-P\'erez, Emilio Soria-Olivas, Marcelino, Mart\'inez-Sober, Antonio J. Serrano-L\'opez, Joan Vila-Franc\'es, Juan, G\'omez-Sanch\'is

PDF

Open Access 1 Repo

TL;DR

This paper introduces the modulus activation function for computer vision, demonstrating it improves generalization, avoids vanishing gradients, and is suitable for hardware applications, outperforming traditional nonlinearities.

Contribution

It proposes a new non-monotonic activation function, the modulus, and empirically shows its advantages in accuracy and gradient behavior in vision tasks.

Findings

01

Up to 15% accuracy increase on CIFAR100

02

Eliminates vanishing gradient and dying neurons problems

03

Suitable for TinyML and hardware implementations

Abstract

In this work we propose a new non-monotonic activation function: the modulus. The majority of the reported research on nonlinearities is focused on monotonic functions. We empirically demonstrate how by using the modulus activation function on computer vision tasks the models generalize better than with other nonlinearities - up to a 15% accuracy increase in CIFAR100 and 4% in CIFAR10, relative to the best of the benchmark activations tested. With the proposed activation function the vanishing gradient and dying neurons problems disappear, because the derivative of the activation function is always 1 or -1. The simplicity of the proposed function and its derivative make this solution specially suitable for TinyML and hardware applications.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ivallesp/abs
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCCD and CMOS Imaging Sensors · Neural Networks and Applications · Image Processing Techniques and Applications