Activation function dependence of the storage capacity of treelike   neural networks

Jacob A. Zavatone-Veth; Cengiz Pehlevan

arXiv:2007.11136·cond-mat.dis-nn·February 24, 2021

Activation function dependence of the storage capacity of treelike neural networks

Jacob A. Zavatone-Veth, Cengiz Pehlevan

PDF

Open Access

TL;DR

This paper investigates how different activation functions influence the storage capacity of treelike neural networks, revealing that nonlinearity impacts both capacity and robustness, and offering estimates for common functions.

Contribution

It provides a detailed analysis of the relationship between activation function smoothness and network capacity, connecting theoretical insights to biological neuron behavior.

Findings

01

Nonlinearity can increase storage capacity.

02

Nonlinearity can decrease robustness.

03

Provides estimates for common activation functions.

Abstract

The expressive power of artificial neural networks crucially depends on the nonlinearity of their activation functions. Though a wide variety of nonlinear activation functions have been proposed for use in artificial neural networks, a detailed understanding of their role in determining the expressive power of a network has not emerged. Here, we study how activation functions affect the storage capacity of treelike two-layer networks. We relate the boundedness or divergence of the capacity in the infinite-width limit to the smoothness of the activation function, elucidating the relationship between previously studied special cases. Our results show that nonlinearity can both increase capacity and decrease the robustness of classification, and provide simple estimates for the capacity of networks with several commonly used activation functions. Furthermore, they generate a hypothesis for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural dynamics and brain function · Advanced Fluorescence Microscopy Techniques · Neural Networks and Reservoir Computing