Parametric Variational Linear Units (PVLUs) in Deep Convolutional Networks
Aarush Gupta, Shikhar Ahuja

TL;DR
This paper introduces the Parametric Variational Linear Unit (PVLU), a novel activation function that enhances deep CNN performance by reducing error rates through added nonlinearity and fine-tuning capabilities, especially in transfer learning scenarios.
Contribution
The paper proposes PVLU, a new activation function combining ReLU with a trainable sinusoidal component, improving accuracy and transfer learning performance in deep CNNs.
Findings
PVLU reduces error rates by up to 16.3% on CIFAR-100.
PVLU improves transfer learning models like VGG-16 and VGG-19 on CIFAR datasets.
Fine-tuning with PVLU enhances residual network accuracy by over 10%.
Abstract
The Rectified Linear Unit is currently a state-of-the-art activation function in deep convolutional neural networks. To combat ReLU's dying neuron problem, we propose the Parametric Variational Linear Unit (PVLU), which adds a sinusoidal function with trainable coefficients to ReLU. Along with introducing nonlinearity and non-zero gradients across the entire real domain, PVLU acts as a mechanism of fine-tuning when implemented in the context of transfer learning. On a simple, non-transfer sequential CNN, PVLU substitution allowed for relative error decreases of 16.3% and 11.3% (without and with data augmentation) on CIFAR-100. PVLU is also tested on transfer learning models. The VGG-16 and VGG-19 models experience relative error reductions of 9.5% and 10.7% on CIFAR-10, respectively, after the substitution of ReLU with PVLU. When training on Gaussian-filtered CIFAR-10 images, similar…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Human Pose and Action Recognition
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Visual Geometry Group 19 Layer CNN · Softmax · Kaiming Initialization · Residual Connection · 1x1 Convolution · Dense Connections · Batch Normalization · Convolution · Average Pooling
