Growing Cosine Unit: A Novel Oscillatory Activation Function That Can Speedup Training and Reduce Parameters in Convolutional Neural Networks
Mathew Mithra Noel, Arunkumar L, Advait Trivedi, Praneet Dutta

TL;DR
This paper introduces the Growing Cosine Unit (GCU), an oscillatory activation function that enhances gradient flow, reduces network parameters, and improves performance in convolutional neural networks across multiple benchmarks.
Contribution
The paper proposes the GCU activation function, provides theoretical theorems on non-oscillatory functions, and demonstrates its superiority over existing activations in CNNs.
Findings
GCU outperforms ReLU, Sigmoid, Swish, and Mish on benchmarks.
GCU enables single neurons to learn XOR without feature engineering.
Replacing activations with GCU improves CNN performance on CIFAR datasets.
Abstract
Convolutional neural networks have been successful in solving many socially important and economically significant problems. This ability to learn complex high-dimensional functions hierarchically can be attributed to the use of nonlinear activation functions. A key discovery that made training deep networks feasible was the adoption of the Rectified Linear Unit (ReLU) activation function to alleviate the vanishing gradient problem caused by using saturating activation functions. Since then, many improved variants of the ReLU activation have been proposed. However, a majority of activation functions used today are non-oscillatory and monotonically increasing due to their biological plausibility. This paper demonstrates that oscillatory activation functions can improve gradient flow and reduce network size. Two theorems on limits of non-oscillatory activation functions are presented. A…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpectroscopy Techniques in Biomedical and Chemical Research · Model Reduction and Neural Networks · Neural Networks and Reservoir Computing
MethodsGrowing Cosine Unit · Sigmoid Activation · Tanh Activation
