Learning activation functions from data using cubic spline interpolation

Simone Scardapane; Michele Scarpiniti; Danilo Comminiello; Aurelio; Uncini

arXiv:1605.05509·stat.ML·June 24, 2020

Learning activation functions from data using cubic spline interpolation

Simone Scardapane, Michele Scarpiniti, Danilo Comminiello, Aurelio, Uncini

PDF

Open Access 1 Datasets

TL;DR

This paper introduces a data-dependent method for adapting activation functions in neural networks using cubic spline interpolation, allowing each neuron to learn its own shape and improve performance.

Contribution

It proposes a novel, efficient approach for neuron-specific, data-driven activation function adaptation using cubic spline interpolation with a damping criterion.

Findings

01

Improved performance on benchmark datasets.

02

Neuron-specific activation functions outperform fixed functions.

03

Method is computationally efficient and adaptable.

Abstract

Neural networks require a careful design in order to perform properly on a given task. In particular, selecting a good activation function (possibly in a data-dependent fashion) is a crucial step, which remains an open problem in the research community. Despite a large amount of investigations, most current implementations simply select one fixed function from a small set of candidates, which is not adapted during training, and is shared among all neurons throughout the different layers. However, neither two of these assumptions can be supposed optimal in practice. In this paper, we present a principled way to have data-dependent adaptation of the activation functions, which is performed independently for each neuron. This is achieved by leveraging over past and present advances on cubic spline interpolation, allowing for local adaptation of the functions around their regions of use.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Kylan12/Synthetic-AI-ML-Dataset
dataset· 42 dl
42 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Neural Networks and Applications · Machine Learning and ELM