Nonlinearity Enhanced Adaptive Activation Functions

David Yevick

arXiv:2403.19896·cs.LG·May 14, 2025·1 cites

Nonlinearity Enhanced Adaptive Activation Functions

David Yevick

PDF

Open Access

TL;DR

This paper introduces a general method for adding learned nonlinearities to activation functions, improving neural network accuracy on datasets like MNIST and CNN benchmarks with minimal extra computation.

Contribution

It proposes a novel, general approach for parametric, learned nonlinear activation functions that enhance neural network performance.

Findings

01

Improved accuracy on MNIST dataset

02

Enhanced CNN benchmark performance

03

Minimal additional computational cost

Abstract

A general procedure for introducing parametric, learned, nonlinearity into activation functions is found to enhance the accuracy of representative neural networks without requiring significant additional computational resources. Examples are given based on the standard rectified linear unit (ReLU) as well as several other frequently employed activation functions. The associated accuracy improvement is quantified both in the context of the MNIST digit data set and a convolutional neural network (CNN) benchmark example.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

MethodsSparse Evolutionary Training