ShaResNet: reducing residual network parameter number by sharing weights

Alexandre Boulch

arXiv:1702.08782·cs.CV·March 7, 2017·19 cites

ShaResNet: reducing residual network parameter number by sharing weights

Alexandre Boulch

PDF

Open Access 2 Repos

TL;DR

ShaResNet introduces weight sharing among residual blocks at the same scale to significantly reduce parameters while maintaining near-original accuracy, making deep residual networks more efficient.

Contribution

The paper proposes ShaResNet, a novel architecture that shares convolutional weights across residual blocks at the same scale, reducing parameters without substantial accuracy loss.

Findings

01

Achieves 39% parameter reduction in 152-layer ResNet with minimal accuracy loss

02

Maintains near-original performance with fewer parameters compared to standard ResNet

03

More efficient than ResNet with the same number of parameters

Abstract

Deep Residual Networks have reached the state of the art in many image processing tasks such image classification. However, the cost for a gain in accuracy in terms of depth and memory is prohibitive as it requires a higher number of residual blocks, up to double the initial value. To tackle this problem, we propose in this paper a way to reduce the redundant information of the networks. We share the weights of convolutional layers between residual blocks operating at the same spatial scale. The signal flows multiple times in the same convolutional layer. The resulting architecture, called ShaResNet, contains block specific layers and shared layers. These ShaResNet are trained exactly in the same fashion as the commonly used residual networks. We show, on the one hand, that they are almost as efficient as their sequential counterparts while involving less parameters, and on the other…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning