Multi-Residual Networks: Improving the Speed and Accuracy of Residual Networks
Masoud Abdi, Saeid Nahavandi

TL;DR
This paper introduces multi-residual networks that enhance residual network architectures by increasing residual functions, leading to wider models that improve accuracy and computational efficiency on image classification tasks.
Contribution
The paper proposes a novel multi-residual network architecture that explicitly exploits ensemble behavior, resulting in wider models with better accuracy and efficiency.
Findings
Achieved 3.73% error on CIFAR-10
Outperformed deep residual networks on ImageNet by 0.22%
Implemented model parallelism with 15% computational gains
Abstract
In this article, we take one step toward understanding the learning behavior of deep residual networks, and supporting the observation that deep residual networks behave like ensembles. We propose a new convolutional neural network architecture which builds upon the success of residual networks by explicitly exploiting the interpretation of very deep networks as an ensemble. The proposed multi-residual network increases the number of residual functions in the residual blocks. Our architecture generates models that are wider, rather than deeper, which significantly improves accuracy. We show that our model achieves an error rate of 3.73% and 19.45% on CIFAR-10 and CIFAR-100 respectively, that outperforms almost all of the existing models. We also demonstrate that our model outperforms very deep residual networks by 0.22% (top-1 error) on the full ImageNet 2012 classification dataset.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning
