Improving neural networks with bunches of neurons modeled by Kumaraswamy   units: Preliminary study

Jakub Mikolaj Tomczak

arXiv:1505.02581·cs.LG·May 12, 2015·1 cites

Improving neural networks with bunches of neurons modeled by Kumaraswamy units: Preliminary study

Jakub Mikolaj Tomczak

PDF

Open Access

TL;DR

This paper introduces Kumaraswamy units, a new neural activation function modeled by the Kumaraswamy distribution, demonstrating improved performance over traditional activations on MNIST.

Contribution

It proposes a novel activation function called Kumaraswamy unit, inspired by the Kumaraswamy distribution, and evaluates its effectiveness in neural networks.

Findings

01

Significant reduction in test classification error with Kumaraswamy units

02

Lower test cross-entropy compared to ReLU and sigmoid

03

Effective in shallow neural network on MNIST

Abstract

Deep neural networks have recently achieved state-of-the-art results in many machine learning problems, e.g., speech recognition or object recognition. Hitherto, work on rectified linear units (ReLU) provides empirical and theoretical evidence on performance increase of neural networks comparing to typically used sigmoid activation function. In this paper, we investigate a new manner of improving neural networks by introducing a bunch of copies of the same neuron modeled by the generalized Kumaraswamy distribution. As a result, we propose novel non-linear activation function which we refer to as Kumaraswamy unit which is closely related to ReLU. In the experimental study with MNIST image corpora we evaluate the Kumaraswamy unit applied to single-layer (shallow) neural network and report a significant drop in test classification error and test cross-entropy in comparison to sigmoid unit,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Machine Learning and Algorithms · Mineral Processing and Grinding

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Sigmoid Activation