Activation Functions in Artificial Neural Networks: A Systematic   Overview

Johannes Lederer

arXiv:2101.09957·cs.LG·January 26, 2021·45 cites

Activation Functions in Artificial Neural Networks: A Systematic Overview

Johannes Lederer

PDF

Open Access

TL;DR

This paper provides a comprehensive and current overview of various activation functions used in neural networks, highlighting their properties and significance in deep learning.

Contribution

It offers an analytic synthesis of both traditional and recent activation functions, clarifying their roles and differences in neural network performance.

Findings

01

Summarizes key properties of popular activation functions

02

Highlights the proliferation of new activation functions in deep learning

03

Serves as a resource for researchers and practitioners

Abstract

Activation functions shape the outputs of artificial neurons and, therefore, are integral parts of neural networks in general and deep learning in particular. Some activation functions, such as logistic and relu, have been used for many decades. But with deep learning becoming a mainstream research topic, new activation functions have mushroomed, leading to confusion in both theory and practice. This paper provides an analytic yet up-to-date overview of popular activation functions and their properties, which makes it a timely resource for anyone who studies or applies neural networks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Machine Learning and ELM · Advanced Neural Network Applications