Deep Residual Networks with Exponential Linear Unit

Anish Shah; Eashan Kadam; Hena Shah; Sameer Shinde; Sandip Shingade

arXiv:1604.04112·cs.CV·October 6, 2016

Deep Residual Networks with Exponential Linear Unit

Anish Shah, Eashan Kadam, Hena Shah, Sameer Shinde, Sandip Shingade

PDF

1 Repo

TL;DR

This paper introduces exponential linear units into residual networks, enhancing training speed and accuracy, and demonstrating improved performance on multiple benchmark datasets.

Contribution

It proposes replacing ReLU and Batch Normalization with exponential linear units in residual networks, leading to faster learning and better accuracy.

Findings

01

Faster training convergence in residual networks.

02

Improved test accuracy on CIFAR-10 and CIFAR-100.

03

Enhanced performance on benchmarks like ImageNet and COCO.

Abstract

Very deep convolutional neural networks introduced new problems like vanishing gradient and degradation. The recent successful contributions towards solving these problems are Residual and Highway Networks. These networks introduce skip connections that allow the information (from the input or those learned in earlier layers) to flow more into the deeper layers. These very deep models have lead to a considerable decrease in test errors, on benchmarks like ImageNet and COCO. In this paper, we propose the use of exponential linear unit instead of the combination of ReLU and Batch Normalization in Residual Networks. We show that this not only speeds up learning in Residual Networks but also improves the accuracy as the depth increases. It improves the test error on almost all data sets, like CIFAR-10 and CIFAR-100

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Amihaeseisergiu/Cifar-10-ResNet-ELU-Cutout
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsAverage Pooling · Residual Connection · 1x1 Convolution · Bottleneck Residual Block · Global Average Pooling · Residual Block · Kaiming Initialization · Max Pooling · Convolution · Bitcoin Customer Service Number +1-833-534-1729