Stable ResNet

Soufiane Hayou; Eugenio Clerico; Bobby He; George Deligiannidis,; Arnaud Doucet; Judith Rousseau

arXiv:2010.12859·cs.LG·March 19, 2021

Stable ResNet

Soufiane Hayou, Eugenio Clerico, Bobby He, George Deligiannidis,, Arnaud Doucet, Judith Rousseau

PDF

Open Access

TL;DR

This paper introduces Stable ResNet, a new architecture designed to maintain stable gradients and expressivity even as the network depth approaches infinity, addressing key limitations of traditional ResNets.

Contribution

The paper proposes Stable ResNet architectures that stabilize gradients and preserve expressivity in very deep networks, improving upon existing ResNet models.

Findings

01

Stable ResNet stabilizes gradients in deep networks.

02

Stable ResNet maintains expressivity at infinite depth.

03

The architecture outperforms traditional ResNets in stability and expressivity.

Abstract

Deep ResNet architectures have achieved state of the art performance on many tasks. While they solve the problem of gradient vanishing, they might suffer from gradient exploding as the depth becomes large (Yang et al. 2017). Moreover, recent results have shown that ResNet might lose expressivity as the depth goes to infinity (Yang et al. 2017, Hayou et al. 2019). To resolve these issues, we introduce a new class of ResNet architectures, called Stable ResNet, that have the property of stabilizing the gradient while ensuring expressivity in the infinite depth limit.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Human Pose and Action Recognition · Domain Adaptation and Few-Shot Learning

MethodsAverage Pooling · Residual Connection · Convolution · Kaiming Initialization · Global Average Pooling · 1x1 Convolution · Residual Block · *Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · Bottleneck Residual Block